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5 

<120> Novel polypeptides and methods of their use 
<130> 875.006WO1 

10 <150> US 60/105,575 

<151> 1998-10-26 

<160> 65 

15 <170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2265 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> A modified DNA sequence 



25 


<400> 


. l 














gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 


30 


aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 


35 


ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 


40 gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 



2 





catgggtgtg 


ttgtgggacg 


ycH-cn-n-ciyy 




gcagact tgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


ciyoctyyctciciy 




gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


I, CLCLVr b^Cu\> 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


yyctd^v^v^ya. i* 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 


5 


cgccattgtc 


gaaaaagaca 


*-y *-* ^ j 3 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacggtc gggcctgctg 


yuuL^ uy y y a 


aaacaacaac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac ggcaagaaat 


atcaaucciciy 


aaataataat 
ay ct i- ciy w y c* 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


aCyt-yy Ut^ L.a 


tccaaocatc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


cccaccyggc 




cacgtcagca gctccaactg 


1500 


tO ggctgagttc 


gagcatgcct 


ctagtacttg 




cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


atCCaayaciy 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ccggacaccc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


15 


tcagagcgtc 


gttataagct 


caatgaagcc 




acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


20 cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 2 
25 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

30 <223> A polypeptide encoded by SEQ ID N0:1 

<400> 2 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

35 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
40 50 55 60 



He Val Ser Arg 
65 

Cys Arg His Arg 

5 Lys Gly Phe Val 
100 

Glu Leu Gin Ser 
115 

Asn Lys Lys Cys 
10 130 

His Gly Phe He 
145 

Asp Tyr Leu Gly 

15 Ser Gly Gly Leu 
180 

Ala Asn Trp Lys 
195 

Val Gly Trp Thr 
20 210 

Ser Ser Leu Ala 
225 

Gin Met Thr Ser 

25 Tyr Ser Gly Val 
260 

Gly Gly Ala Lys 
275 

Ala Arg He Tyr 
30 2 90 

Ser Met Leu Thr 
305 

Ala Asn Thr Thr 

35 Pro Glu Asp Leu 
340 

Gly Pro Ala Gly 
355 

Ala Ser Gin Asn 
40 370 



Gin Asn Asp Gly 
70 

Gly Lys Thr Leu 
85 

Cys Ser Tyr His 

Val Pro Phe Glu 
120 

Leu Gly Leu Lys 
135 

Tyr Gly Cys Phe 
150 

Asp Ala Ala Trp 
165 

Glu Leu Val Gly 

Ala Pro Ala Glu 
200 

His Ala Ser Ser 
215 

Gly Asn Ala Ala 
230 

Lys Tyr Gly Ser 
245 

His Ser Ala Asp 

Gin Glu Arg Leu 
280 

Arg Ser His Leu 
295 

Cys Ser Gly Val 
310 

Glu Val Trp Thr 
325 

Lys Arg Arg Leu 

Phe Trp Glu Ser 
360 

Gly Lys Lys Tyr 
375 



Ser He Arg Ala 
75 

Val Ser Val Glu 
90 

Gly Trp Gly Phe 
105 

Lys Asp Leu Tyr 

Glu Val Ala Arg 
140 

Asp Gin Glu Ala 
155 

Tyr Leu Glu Pro 
170 

Pro Pro Gly Lys 
185 

Asn Phe Val Gly 

Leu Arg Ser Gly 
220 

Leu Pro Pro Glu 
235 

Gly Met Gly Val 
250 

Leu Val Pro Glu 
265 

Asn Lys Glu He 

Asn Cys Thr Val 
300 

Phe Lys Val Trp 
315 

Tyr Ala He Val 
330 

Ala Asp Ser Val 
345 

Asp Asp Asn Asp 

Gin Ser Arg Asp 
380 



Phe Leu Asn Val 
80 

Ala Gly Asn Ala 
95 

Gly Ser Asn Gly 
110 

Gly Glu Ser Leu 
125 

Val Glu Ser Phe 

Pro Pro Leu Met 
160 

Met Phe Lys His 
175 

Val Val He Lys 
190 

Asp Ala Tyr His 
205 

Glu Ser He Phe 

Gly Ala Gly Leu 
240 

Leu Trp Asp Gly 
255 

Leu Met Ala Phe 

270 . 
Gly Asp Val Arg 
285 

Phe Pro Asn Asn 

Asn Pro He Asp 
320 

Glu Lys Asp Met 
335 

Gin Arg Thr Val 
350 

Asn Met Glu Thr 
365 

Ser Asp Leu Leu 



Ser Asn Leu Gly Phe Gly Glu Asp 
385 \ 390 

Gly Val Val Gly Lys Ser Ala He 
405 

5 Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
435 440 

Arg 

10 



4 

Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 3 
<211> 9841 
<212> DNA 
15 <213> Artificial Sequence 



<220> 

<223> A modified DNA sequence 



20 <400> 3 



gaattcatca 


ggaagacatt 


caaatgaacg 


taaacaataa gggcagcgtc 


tgtatttgcg 


60 


gcagcgaaat 


gctccctaaa 


ttcctcattt 


accccatctg aggattgctt 


tatgacagta 


120 


aagtggattg 


aagcagtcgc 


tctttctgac 


atccttgaag gtgacgtcct 


cggcgtgact 


180 


gtcgagggca 


aggagctggc 


gctgtatgaa 


gttgaaggcg aaatctacgc 


taccgacaac 


240 


25ctgtgcacgc 


atggttccgc 


ccgcatgagt 


gatggttatc tcgagggtag 


agaaatcgaa 


300 


tgccccttgc 


atcaaggtcg 


gtttgacgtt 


tgcacaggca aagccctgtg 


cgcacccgtg 


360 


acacagaaca 


tcaaaacata 


tccagtcaag 


attgagaacc tgcgcgtaat 


gattgatttg 


420 


agctaagaat 


tttaacagga 


ggcaccccgg 


gccctagagc gtaatcaccc 


ccattccatc 


480 


ttttttaggt 


gaaaacatga 


attacaataa 


taaaatcttg gtaagtgaat 


ctggtctgag 


540 


30ccaaaagcac 


ctgattcatg 


gcgatgaaga 


acttttccaa catgaactga 


aaaccatttt 


600 


tgcgcggaac 


tggctttttc 


tcactcatga 


tagcctgatt cctgcccccg 


gcgactatgt 


660 


taccgcaaaa 


atggggattg 


acgaggtcat 


cgtctcccgg cagaacgacg 


gttcgattcg 


720 


tgcttttctg 


aacgtttgcc 


ggcatcgtgg 


caagacgctg gtgagcgtgg 


aagccggcaa 


780 


tgccaaaggt 


tttgtttgca 


gctatcacgg 


ctggggcttc ggctccaacg 


gtgaactgca 


840 


35gagcgttcca 


tttgaaaaag 


atctgtacgg 


cgagtcgctc aataaaaaat 


gtctggggtt 


900 


gaaagaagtc 


gctcgcgtgg 


agagcttcca 


tggcttcatc tacggttgct 


tcgaccagga 


960 


ggcccctcct 


cttatggact 


atctgggtga 


cgctgcttgg tacctggaac 


ctatgttcaa 


1020 


gcattccggc 


ggtttagaac 


tggtcggtcc 


tccaggcaag gttgtgatca 


aggccaactg 


1080 


gaaggcaccc 


gcggaaaact 


ttgtgggaga 


tgcataccac gtgggttgga 


cgcacgcgtc 


1140 


40ttcgcttcgc 


tcgggggagt 


ctatcttctc 


gtcgctcgct ggcaatgcgg 


cgctaccacc 


1200 
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tgaaggcgca ggcttgcaaa tgacctccaa atacggcagc ggcatgggtg tgttgtggga 1260 

cggatattca ggtgtgcata gcgcagactt ggttccggaa ttgatggcat tcggaggcgc 1320 

aaagcaggaa aggctgaaca aagaaattgg cgatgttcgc gctcggattt atcgcagcca 1380 

cctcaactgc accgttttcc cgaacaacag catgctgacc tgctcgggtg ttttcaaagt 1440 

Satggaacccg atcgacgcaa acaccaccga ggtctggacc tacgccattg tcgaaaaaga 1500 

catgcctgag gatctcaagc gccgcttggc cgactctgtt cagcgaacgg tcgggcctgc 1560 

tggcttctgg gaaagcgacg acaatgacaa tatggaaaca gcttcgcaaa acggcaagaa 1620 

atatcaatca agagatagtg atctgctttc aaaccttggt ttcggtgagg acgtatacgg 1680 

cgacgcggtc tatccaggcg tcgtcggcaa atcggcgatc ggcgagacca gttatcgtgg 174 0 

lOtttctaccgg gcttaccagg cacacgtcag cagctccaac tgggctgagt tcgagcatgc 1800 

ctctagtact tggcatactg aacttacgaa gactactgat cgctaacaga cgagtcgacc 1860 

atgatgatca atattcaaga agacaagctg gtttccgccc acgacgccga agagattctt 1920 

cgtttcttca attgccacga ctctgctttg caacaagaag ccactacgct gctgacccag 1980 

gaagcgcatt tgttggacat tcaggcttac cgtgcttggt tagagcactg cgtggggtca 2040 

15gaggtgcaat atcaggtcat ttcacgcgaa ctgcgcgcag cttcagagcg tcgttataag 2100 

ctcaatgaag ccatgaacgt ttacaacgaa aattttcagc aactgaaagt tcgagttgag 2160 

catcaactgg atccgcaaaa ctggggcaac agcccgaagc tgcgctttac tcgctttatc 2220 

accaacgtcc aggccgcaat ggacgtaaat gacaaagagc tacttcacat ccgctccaac 2280 

gtcattctgc accgggcacg acgtggcaat caggtcgatg tcttctacgc cgcccgggaa 2340 

20gataaatgga aacgtggcga aggtggagta cgaaaattgg tccagcgatt cgtcgattac 2400 

ccagagcgca tacttcagac gcacaatctg atggtctttc tgtgattcag tgaccatttt 2460 

tacaaatggt cactgcaacc gcggtcacca ttaatcaaag ggaatgtacg tgtatgggca 2520 

atcaacaagt cgtttcgata accggtgcag gctcaggaat cggtctcgaa ctggttcggt 2580 

cctttaagtc ggccggttat tacgtatccg ctctcgtacg aaacgaggag caagaggcgc 2640 

25ttctttgcaa agagttcaag gacgcactcg agattgtagt gggcgatgtc cgggaccacg 2700 

caacaaatga gaagctgata aagcaaacaa tcgatagatt cggtcatctt gattgtttta 2760 

ttgcaaatgc cggtatctgg gattacatgc tgagcatcga agagccttgg gagaaaatat 2820 

cgagcagttt tgacgaaata ttcgacatta atgtcaagag ctatttcagt ggcatcagtg 2880 

ccgccctgcc ggaactgaaa aagactaacg gatcagtggt gatgaccgct tcggtgtcgt 2940 

30cccatgcggt cggtggtggt ggttcttgct acatcgccag caagcatgcg gtgctcggta 3000 

tggttaaggc tttggcctac gaattggccc ccgaagttcg cgtgaacgct gtttcgccgg 3060 

ggggcaccgt gacgtctctg tgcggtcccg cgagcgccgg tttcgacaaa atgcacatga 3120 

aagacatgcc cggcatcgac gatatgatca aaggtctcac gcctcttggg tttgcagcca 3180 

agcccgaaga cgtggtggca ccctatttgt tgctggcttc gcgaaagcaa ggaaaattca 3240 

35tcaccggcac cgtgattagc attgatggcg gtatggcgct cggtcgcaag tgagcttgta 3300 

gccgatcaga agttatagac acatttcagg tgacgcccca tgaagacaaa actgtttatc 33 60 

aataacgcct ggatcgattc tagtgaccag cagaccttcg agcgcataca ccccgtcagc 3420 

agcgatgtgg tgactgagag cgcaaacgcc acagtgacgg acgcgataaa ggcggcgcaa 3480 

gcggccgagg aggcgttcaa gacctggaag gccgttggac cttcagagcg tcgccgcctt 3540 

40ctcctaaagg tcgccgatgt catggaaagt aaaacaccca agttcatcga agtgatggcc 3600 
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atggaggtgg 


gagcttccgc 


cctttgggcc 


ggat tcaacg 


tccatgcgtc 


tgccaatgtg 


3660 


ttccgagagg 


ctgccfccgct 


ggctacccaa 


attcagggtg 


aaaccatccc 


aacggacaaa 


3720 


gccgaaacgc 


tctcaatgac 


actacgtcag 


ccggtcggcc 


cgatcctaag 


catcgttcca 


3780 


tggaacggca 


ccgcagtgct 


tgcggcacga 


gccatcgctt 


atccgctggt 


ctgtggcaac 


3840 


Sactgtggtgt 


tcaaaggctc 


tgaatttagt 


cccgcgacgc 


atgccctgat 


cacccagtgc 


3900 


gtgcaggaag 


ccgggctgcc 


cgctggcgtg 


ctcaattacc 


tcaactcttc 


gcctgaccgt 


3960 


tcgcccgaga 


tcgctgacgc 


actgatctct 


gccaaggaga 


tccgccgcat caacttcacg 


4020 


ggttccaccc 


gcgtgggcag 


cattatcgcg cagaaagccg 


cgcaacacct 


caagcgctgc 


4080 


ctgctggagc 


tcggcggcaa 


gtccccgctt 


attgttctgg 


atgatgcaga 


catcgatgcg 


4 140 


lOgcggtcaagg 


cagcggtgtt 


cggtagcttc 


ctgttccaag 


gtcagatctg 


catgtccact 




gagcgcttga 


tcgttgatga 


gaagatagcc 


gacgaatttg 


tcgcaaaatt 


tgtcgaaaaa 


4760 
t*ou 


actaagcgct 


tgagcgcagg 


cgacccgtgc 


gtaactggcg 


actgcatcat 


cggcccgatg 


a i 1 n 

fk J Z \J 


gtctcgccaa 


attcgggtga 


gcggatcaat 


ggtttgttca 


aagacgcgat 


cgacaaaggg 


A fl ft 
Tt J 0 U 


gcaaaagttg 


tttgcggcgg 


cttggcccaa 


ggtgcgctca 


tgccggccac gatcctggat 


A A Ad 


15cacgtcaaat 


ctgacatgcg 


gatttacgat 


gaggagacct 


ttggtcccat 


caccgtggta 


A C. Aft 


atccgttgta 


aaggcgaagc 


agaggccgtc 


cgcattgcca 


acgacagcgt 


ctatggcctg 


4560 


tcgtcgggcg 


tatttggccg 


cgacatcaac 


cgcgctctac 


gcgtgggtat 


gtccatcgaa 


4620 


tatggttctg 


tacacatcaa 


cggttcgacc 


gtccagaacg 


aggcgcaggc 


tccttacgga 


4680 


ggcaccaaga 


acaccggcta 


cgggcgcttc 


gacggccgtg 


ctgtaatcga 


cgagttcaca 


4740 


20gagatcaagt 


ggctgaccat 


cgaacctttc gagcagcaat 


atcccttctg 


ataagcacta 


4800 


actcccagga 


atcaaactat 


gagtaagcaa 


gctgcagtta 


tcgagctcgg 


atacatgggt 


4860 


atctcggtca 


aggaccctga 


tgcgtggaaa 


tcatttgcca 


cggatatgct 


aggtctgcaa 


4920 


gttcttgatg 


agggtgagaa 


ggaccgtttc 


tatctgcgga 


tggattactg 


gcatcatcgg 




atcgtagtcc 


atcacaacgg 


acaggacgac 


ttggagtacc 


taggctggcg 


tgtagccggc 


504 0 


25aagccggagt 


tcgaagctct 


gggtcaaaag 


cttattgatg 


ccggttacaa gatccgcatc 


jIUv 


tgcgacaaag 


ttgaggctca 


ggagcgtatg gtgttgggtc 


tgatgaagac 


agaagatccg 


ci en 


ggcggcaacc 


cgaccgagat 


attctggggc 


ccccggatcg 


acatgagcaa 


cccgttccat 




cccggtcgcc 


ccctgcacgg 


aaagtttgtg 


accggtgacc 


aaggcttggg 


ccattgcatc 




gttcgccaaa 


ccgacgtcgc 


agaagctcat 


aagttttata 


gcctgctggg 


cttccgtggg 


CIA ft 


30gacgtcgaat 


accggattcc 


gttgcccaac 


ggcatgactg 


ccgaactgtc gttcatgcat 


CA ft ft 


tgcaacgccc 


gtgatcactc 


cattgctttt 


ggtgccatgc 


ccgctgccaa 


acgactcaat 


C4 Cft 


cacttgatgc 


ttgagtacac 


ccatatggaa gacttgggat 


acacgcacca 


acagtttgta 


ccon 


aagaacgaaa 


ttgacattgc 


cttgcagctt 


ggcattcacg 


ccaacgacaa 


ggcgttgacg 


3 J o w 


ttctatggtg 


caacgccttc 


gggctggctc 


attgagcccg 


gctggcgagg 


tgccacggcc 


C ft 


35atagatgaag 


cggagtatta 


cgtcggcgac 


atcttcggcc 


atggcgtgga ggccactgga 


5700 


tatggcctgg 


atgtaaaact 


gagctaaaga 


tgcgcgctcg 


ttgggcgagg 


ctctagtcca 


5760 


gcatcttcat 


acgcaaccaa 


ccttgcaggg 


cgatgagatc 


aaaggacgtt 


aaagcgaagg 


5820 


ggaagtggtt 


cgggccatgc 


gcataccgat 


ccatgacatt 


tgtttcatag 


tatataggta 


5880 


gataggtgaa 


tcaagcgctt 


agtcaactag 


tggacacatc 


tgttccatga ggctatctac 


5940 


40tatctattca 


aaacaagaat 


aataaatagg 


atgaaaataa 


taatgataaa aagaacgatt 


6000 
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tgtcttgtgt 


atcctctatt 


ctgtttggca 


agccccacat 


gggccgaaga 


gtcgccttgg 


6060 


acgtaccgta 


ttggtatgac 


taatgtagct 


ttcgatgcta 


gcgcaaaagt 


atacttaaat 


6120 


ggtcagcggg 


tgccaggagg 


aagcgctgat 


gcgagcgata 


acaacgcgct 


tacattcgac 


C T O A 


ttcggctacg 


ccatcaacga 


ccagtggaat 


gtacgtgcga 


ttgtcggcat 


tccgcctaca 


fiO A ft 


5actaaagtga 


cgggcgcagg 


cacacttcct 


ggtatccagc 


cggggaaaau 




6300 


ccaacagtat 


taacgttgaa 


ctataacctc 


cccgctttgg 


ytcccgcLcg 


r-i /-* r~> f- t~* Z± f~* t~ a 


6360 


ggtgcgggag 


tcaattacac 


gcggattttt 


gaaagtcggg 


-v /-* ft *- a a f- r% |- 


ctcictct Ley L 


6420 


gatgccgacc 


acgcttggtc ccccgcgcta 


catgttggtg 


a a fr* ^ /i a 

CCyatattga 




6480 


ggttggttcg 


ttagcattga 


cacccggaay 


ttatacctga 


aaaccgacyt* 


ctuwciyyy tav* 


6540 


lOttggggccac 


aggaggctaa 


agcacgggta 


actcttgacc 


j-m A- «^ A 4- *^ /"l 

CclCCaCtaaC 


l. L.v*.yctu^y v-y 


6600 


atcggacgcc 


aattctgatg 


attctgttta 


aagttcttta 


tctatctaac 


cgcaaagggt 


ODOw 


gtttccatgt 


cgaataaaat 


tatgaaaacg 


tcgcgtctta 


ccgccgaaga 


4— V- jr-i 

LaUCddCyyC 


6720 


gcctggacta 


taatgcccac 


accctcgacg 


cctgatgctt 


ctgattggcg 


cagcactgcc 


an 
0 / ov 


accgtggact 


tagaagagac 


tgcccgcata 


gttgaagagc 


tgattgcagc 


tggtgtcaac 


CQ A C\ 

bo4U 


15ggtattctaa 


gtatgggtac 


ttttggtgag 


tgcgccacgt 


tgacctggga 


tgaaaaacgt 




gattatgtct 


cgacgattgt 


cgagaccatt 


cgtggtcgcg 


tgccttattt 


ctgtggcacg 


6960 


acagccttaa 


atacccgaga 


agtcatccgc 


cagacccgag 


agcttatcga 


tattggegee 


7020 


aacggcacta 


tgctcggggt 


gccgatgtgg 


gtgaagatgg 


acctgcctac 


agcggttcag 


7080 


ttctatcgtg 


atgttgcaga 


tgcggtacca 


gaggctgcca 


ttgcgattta 


cgccaacccc 


714 0 


20gaagcattca 


agttcgactt 


ccctcgccca 


ttctgggcag 


agatgtccaa 


aattccgcag 


7200 


gtagtgactg 


cgaagtatct 


aggcatcgga 


atgcttgact 


tggacctgag 


actggcaccc 


7260 


aacatccgct 


tccttcccca 


cgaagatgac 


tattacgcgg 


ccgcacgcat 


caatcccgag 


/ JzU 


cgcataaccg 


cgttctggtc 


aagcggggcc 


atgtgcggcc 


cggctaccgc 


catcatgttg 


t *a 0 n 


cgtgacgaag 


tggtgcgggc 


caagagcacc 


ggtgactggg 


ccaaggccaa 


agccatctcc 


7440 


25gatgatatgc 


gtgcagccga ctcgacattg 


tttccgcgtg gcgacttttc 


ggagttctcg 


7500 


aagtataata 


tcgggcttga 


aaaggcacgg 


atggatgcgg 


ctggttggct 


caaggctggg 


/DOO 


ccctgccgtc 


cgccctacaa 


ccttgttcca 


gaagactacc 


tcgctggtgc 


acagaaatca 




ggcaaggctt 


gggccgcgct 


gcacgctaaa 


tacagtaatg 


aattgaagta 


gttcacctcc 


7680 


gcagacctga 


gtgacagggt 


ggcgcagacg 


ctgagggtgc 


aggaattaag 


tgagctaaag 


7740 


30cacatttctt 


gcgccaggca 


ttgccagatc 


agcaaagttt 


gctgatctgg 


cagtttcaaa 


7800 


aatttgggcg 


aaagctgata 


tcaggaatac 


gggataaagg 


cagtgcacca 


taacgacggg 


7860 


gcgtgccatt 


cgtgatgaac gattttgcta 


ttgtgccgac 


ttctgttctt ggagtgtttg 


7 920 


attgtgattg 


tcgattttta 


tttcgatttt 


ttgagtccgt 


tctcttactt 


ggccaaccat 


/SOU 


cgtttgtcaa 


agcttgcgca 


agactatggc 


ttttccattc 


gttattacgc 


aatcgatttg 


8040 


35gcgcgagtta 


aaatagccat 


cggaaacgtt 


ggtccatcta 


atcgcgacct 


gatagtcaag 


8100 


ctggactatt 


tgaaagtaga 


tttgcaacgg 


tgggccgagc 


tttacgaaat 


accgttggta 


8160 


ttcccagcta 


actacaacag 


ccgacggatg 


aatactgggc 


tttattactc 


gggagccatg 


8220 


gcacagactg 


gtgcctatgt gaatgtagta 


tttaatgcgg 


tttggggaga 


tggcatagct 


8280 


ccagatttgg 


aaagcttgcc 


tgctctggta 


tctgaaaaac 


taggctggga 


tcgtagcgcc 


8340 


40ttcgaggact 


ttatcagcag cgatgccgca 


acagagaggt 


atgacgagca gacacatgcc 


8400 



8 



gcgatcgaac 


gcaaagtgtt 


cggtgtgcca 




^y y y y ** *- y 61 


a a t~ at* a at - a a 


8460 


ggaaacgacc 


gtctatttat 


gctcgagaac 


gcagtgggag 


y uy i-y t-t- tyi 


aaatggagaa 


8520 


tagtcgctac 


ggagcgcttg 


tgccggctaa 


atgccgatat 


aayuyyuuyct 


rrhnflt" cci t* t" 


8580 


atttgctcga 


tacagcgctt 


tcaaaatcag 


cggc tactga 


ciy U c*y dUaa 




8640 


Sctacttcagg 


catcctgtgc 


gacacaaagt 


tttacctgta 




LaLttty ciy i- 


8700 


ttggaatggt 


agctgactcg 


ctatgcgacc 


agcgatagcc 


LaaCaa^aCa 


U^^uLLa^ Ly 


8760 


gtaacggtgg 


ggtgtgaagc 


tcctgcaaca 


atgtagcccc 




t*At* ttactac 


8820 


gaggtgaagc 


acagatgctc ggagccgtac 


cggcttgtgg 


cy (- tayyc uy 


nra ant - a taa 


8880 


gcaacgtaag 


tgggggttgg 


ggcgcaatgg 


gaaccaaaaa 


ccaacgcaag 


rrttaccaac 


8940 


lOgtcgttcggt 


gccttcctcc 


catgcctccg 


cctcgataaa 


gcagctgcgc 


CI d \~ V_# ^ V* I. 


9000 


cctggctgat 


ctcggttagt 


aggtcatgca 


aggtcttgtc 


cagcgcctcg 


t eac tec era fc 


9060 


acggaatggt 


cagctcgtaa 


tggccggtct 


ccgaccgctt 


catgccgtag 


y y ^ UL^ayy^ 


9120 


agtagcgctc 


gatgttctcc 


gtggcccgct 


tccgaccgcg 


catgaacttg 




9180 


ccaccgccag 


gcgcagggtg 


acggtggcca 


cccgctcgac 


ggttgactct 


y<-t-yy uy oty 


9240 


IScgatattgcg 


cttttgacct 


cgcgccaggg 


cgctcttctg gtacgtcccg 


atctcgacgc 


9300 


cacggtggcg 


taggtagctg 


tacagggtgc 


tcttggagat 


gtgcaacttc 


tcgccgatgg 


9360 


cgctgacgct 


caggcggccc 


tcgcggtaca 


gggtctccgc 


cgccatggcg 


gtggcctcgg 


9420 


ccttggctgg 


caggcccttg 


ggacggcgac 


cgatccggcc 


tcgagtccgt 


gccgccgaca 


QA q n 
y*± o u 


ggcccgcctg 


agtccgctcg 


cggatcagct 


cgcgctcgaa 


ctgaacaggt 


tgaacaccag 


9540 


20gcgatcttgg 


gcgtgggtgc 


tgtcaatggg 


gtcgttcagg 


ctctgcaagc 


cgactttgcg 


9600 


tgcagccagc 


tagccgacca 


actcaaccag 


gtgcttgagc 


gagcgaccga ggcgatccag 


9660 


cttccagatc 


accacggcat 


cgcccgctcg 


aacatgggct 


agcaacttgt 


ccaactccgg 


9720 


ccgcgcgctt 


tttgcgccgc 


tggcgatgtc 


ttgatagatg 


cgttcgcacc 


cggcctgttt 


9780 


cagggcatcg 


acctggaggt 


cggcgttcta 


atcccgagtg 


ctcacccgcg 


tataaccgat 


9840 


25c 
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<210> 4 
30 <211> 2515 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> A modified DNA sequence 

<400> 4 

gaattcatca ggaagacatt caaatgaacg taaacaataa gggcagcgtc tgtatttgcg 
gcagcgaaat gctccctaaa ttcctcattt accccatctg aggattgctt tatgacagta 
40 aagtggattg aagcagtcgc tctttctgac atccttgaag gtgacgtcct cggcgtgact 





gtcgagggca 


aggagctggc 


gctgtatgaa 




ctgtgcacgc 


atggttccgc 


ccgcatgagt 




tgccccttgc 


atcaaggtcg 


gtttgacgtt 




acacagaaca 


tcaaaacata 


tccagtcaag 


5 


agctaagaat 


tttaacagga 


ggcaccccgg 




ttttttaggt 


gaaaacatga 


attacaataa 




ccaaaagcac 


ctgattcatg 


gcgatgaaga 




tgcgcggaac 


tggctttttc 


tcactcatga 




taccgcaaaa 


atggggattg 


acgaggtcat 


10 


tgcttttctg 


aacgtttgcc 


ggcatcgtgg 




tgccaaaggt 


tttgtttgca 


gctatcacgg 




gagcgttcca 


tttgaaaaag 


atctgtacgg 




gaaagaagtc 


gctcgcgtgg 


agagcttcca 




ggcccctcct 


cttatggact 


atctgggtga 


15 


gcattccggc 


ggtttagaac 


tggtcggtcc 




gaaggcaccc 


gcggaaaact 


ttgtgggaga 




ttcgcttcgc 


tcgggggagt 


ctatcttctc 




tgaaggcgca 


ggcttgcaaa 


tgacctccaa 




cggatattca 


ggtgtgcata 


gcgcagactt 


20 


aaagcaggaa 


aggctgaaca 


aagaaattgg 




cctcaactgc 


accgttttcc 


cgaacaacag 




atggaacccg 


atcgacgcaa 


acaccaccga 




catgcctgag 


gatctcaagc 


gccgcttggc 




tggcttctgg 


gaaagcgacg 


acaatgacaa 


25 


atatcaatca 


agagatagtg 


atctgctttc 




cgacgcggtc 


tatccaggcg 


tcgtcggcaa 




tttctaccgg 


gcttaccagg 


cacacgtcag 




ctctagtact 


tggcatactg 


aacttacgaa 




atgatgatca 


atattcaaga 


agacaagctg 


30 


cgtttcttca 


attgccacga 


ctctgctttg 




gaagcgcatt 


tgttggacat 


tcaggcttac 




gaggtgcaat 


atcaggtcat 


ttcacgcgaa 




ctcaatgaag 


ccatgaacgt 


ttacaacgaa 




catcaactgg 


atccgcaaaa 


ctggggcaac 


35 


accaacgtcc 


aggccgcaat 


ggacgtaaat 




gtcattctgc 


accgggcacg 


acgtggcaat 




gataaatgga 


aacgtggcga 


aggtggagta 




ccagagcgca 


tacttcagac 


gcacaatctg 




tacaaatggt 


cactgcaacc 


gcggtcacca 



40 



9 



gttgaaggcg 


aaatctacgc 


taccgacaac 


240 


gatggttatc 


tcgagggtag 


agaaatcgaa 


300 


tgcacaggca 


aagccctgtg 


cgcacccgtg 


360 


attgagaacc 


tgcgcgtaat 


gattgatttg 


420 


gccctagagc 


gtaatcaccc 


ccattccatc 


480 


taaaatcttg 


gtaagtgaat 


ctggtctgag 


540 


acttttccaa 


catgaactga 


aaaccatttt 


600 


tagcctgatt 


cctgcccccg 


gcgactatgt 


660 


cgtctcccgg 


cagaacgacg 


gttcgattcg 


720 


caagacgctg 


gtgagcgtgg 


aagccggcaa 


780 


ctggggcttc 


ggctccaacg 


gtgaactgca 


840 


cgagtcgctc 


aataaaaaat 


gtctggggtt 


900 


tggcttcatc 


tacggttgct 


tcgaccagga 


960 


cgctgcttgg 


tacctggaac 


ctatgttcaa 


1020 


tccaggcaag 


gttgtgatca 


aggccaactg 


1080 


tgcataccac 


gtgggttgga 


cgcacgcgtc 


1140 


gtcgctcgct 


ggcaatgcgg 


cgctaccacc 


1200 


atacggcagc 


ggcatgggtg 


tgttgtggga 


1260 


ggttccggaa 


ttgatggcat 


tcggaggcgc 


1320 


cgatgttcgc 


gctcggattt 


atcgcagcca 


1380 


catgctgacc 


tgctcgggtg 


ttttcaaagt 


1440 


ggtctggacc 


tacgccattg 


tcgaaaaaga 


1500 


cgactctgtt 


cagcgaacgg 


tcgggcctgc 


1560 


tatggaaaca 


gcttcgcaaa 


acggcaagaa 


1620 


aaaccttggt 


ttcggtgagg 


acgtatacgg 


1680 


atcggcgatc 


ggcgagacca 


gttatcgtgg 


1740 


cagctccaac 


tgggctgagt 


tcgagcatgc 


1800 


gactactgat 


cgctaacaga 


cgagtcgacc 


1860 


gtttccgccc 


acgacgccga 


agagattctt 


1920 


caacaagaag 


ccactacgct 


gctgacccag 


1980 


cgtgcttggt 


tagagcactg 


cgtggggtca 


2040 


ctgcgcgcag 


cttcagagcg 


tcgttataag 


2100 


aattttcagc 


aactgaaagt 


tcgagttgag 


2160 


agcccgaagc 


tgcgctttac 


tcgctttatc 


2220 


gacaaagagc 


tacttcacat 


ccgctccaac 


2280 


caggtcgatg 


tcttctacgc 


cgcccgggaa 


2340 


cgaaaattgg 


tccagcgatt 


cgtcgattac 


2400 


atggtctttc 


tgtgattcag 


tgaccatttt 


2460 


ttaatcaaag 


ggaatgtacg 


tgtat 


2515 
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<210> 5 

<211> 9706. 

<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> A modified DNA sequence 



<400> 5 



lOgctgatttcg 


accaattatt 


ccacgccgtc 


tgcgccagca 


aggccgtaat 


cagttcgtaa 


60 


aggggagccc 


gcagtaacag 


cgaaccgacg 


ccggtagctg 


cgcgcgtgaa 


tcctgagatc 


ion 


acaaattctt 


ggcgattatc 


gcatttcaat 


ttcacccaag 


cgccgttcct 


tgcgctgctg 


1 Q A 

180 


tagcggcggc 


catcacgctt 


ctccacgatt 


ccctccagct 


gcatccggca 


agcgttggcc 


240 


agcagctgca 


ctgggtaagc 


ttccagggta 


tcgctgaagc 


gtacgcgctc 


caatgggcag 


300 


15tgctcgagga 


gcgcctcaag 


tacctggcgg 


ctcagttcaa 


cgccctctcc 


ccgcagatct 


360 


gcaccgtcga 


ggtagagcag 


gtcgaacgcc 


acgtacacaa 


atttgtcggt 


acaccgtgcg 


420 


gcgaacgctg 


actgcagggc 


ttgaaatgct 


ggtcgaccgt 


cgtcatactg 


gaatacgacc 


480 


tcaccgtcga 


gccaggctga 


atggacggct 


aggcgtccca 


gctatttggc 


caggagtggc 


540 


atatgatcga 


tctagtcaaa 


cccgtttttg 


gtgaaaagct 


gcacctggtc 


accatcgatc 


600 


20cgtgctaaca 


gacgatacca 


gtcgtacttg 


atctcgtatc 


gccatgctcc 


tgccgacgac 


660 


agtgtcagta 


gagtaaccag 


ttgtggcgcg 


atccaacacg 


ggtttggtgc 


cttgggcggt 


720 


ttcactatcg 


ccacctcgct 


ataggaacat 


tccttcaggg 


tggagtgcgt 


aattttctga 


780 


aaggggagcc 


aggttatgag 


tattcacatt 


ggtgataaac 


aacatcactt 


atgcgttatt 


840 


gacatataac 


gtcgtattca 


cgattattta 


ccatataagt 


cttataataa 


cgaagccata 


900 


25ttatggaact 


cctcatacaa 


ccgaaaaatc 


gcataattcc 


cttcagtgcc 


ggtgccaacc 


960 


ttctggaagt 


gcttcgcgag 


aacggtgtag 


ctatttccta 


cagttgcttg 


tctgggcgtt 


1020 


gcggaacctg 


tcgctgccgg 


gttatagatg 


gcagtgtcat 


tgattctggg 


gcggaaaatg 


1080 


ggcaatcaaa 


cctcaccgac 


aagcagtatg 


tgctcgcctg 


tcagtcagta 


ctcactggca 


1140 


attgcgctat 


cgaagtccca 


gaagccgacg 


aaattgtcac 


tcacccggcg 


cgaatcatca 


1200 


30agggcacagt 


ggtcgcagtc 


gagtcgccca 


ctcacgatat 


ccgtcgctta 


cgcgtacgcc 


1260 


tctccaagcc 


cttcgagttc 


tcacccggac 


agtacgcgac 


actgcagttc 


agccctgagc 


1320 


atgcgcgtcc 


gtattcaatg 


gcaggtttgc 


cagatgacca 


agaaatggag 


ttccacatac 


1380 


gcaaggtgcc 


gggtgggcgc 


gtcacggagt 


atgttttcga 


acacgtccgc 


gaaggtacaa 


1440 


gcatcaagtt 


gagcgggcct 


cttggtacgg 


cttatctacg 


tcagaagcac 


accggaccga 


1500 


35tgctgtgtgt 


aggtggcggg 


accggactcg 


caccggtgct 


gtcgattgtt 


c g c gg c 9 c g c 


1560 


tgaagtcggg 


tatgacgaac 


cccatcctcc 


tttatttcgg 


ggtgcgcagt 


cagcaagacc 


1620 


tctacgacgc 


agagcgattg 


cacaaactcg 


ccgctgacca 


ccctcaactg 


accgtacaca 


1680 


cggtgattgc 


aacgggcccg 


attaatgagg 


gtcagcgagc 


cggcctaatt 


accgatgtga 


1740 


tcgaaaaaga 


catcctttcg 


ctggctgggt 


ggagggccta 


cctgtgcggc 


gcaccagcga 


1800 


40tggttgaagc 


gttgtgcacc 


gtcaccaagc 


atcttggaat 


atcacccgaa 


catatttatg 


1860 



II 



ccgatgcctt 


ctatcccggt 


gggatctgaa 


tagttcccgg 


ccatgcacct 


ctgtccatcg 


1920 


agaattcatc 


aggaagacat 


tcaaatgaac 


gtaaacaata 


agggcagcgt 


ctgtatttgc 


1980 


ggcagcgaaa 


tgctccctaa 


attcctcatt 


taccccatct 


gaggattgct 


ttatgacagt 


2040 


aaagtggatt 


gaagcagtcg 


ctctttctga 


catccttgaa 


ggtgacgtcc 


tcggcgtgac 


2100 


5tgtcgagggc 


aaggagctgg 


cgctgtatga 


agttgaaggc 


gaaatctacg 


ctaccgacaa 


2160 


cctgtgcacg 


catggttccg 


cccgcatgag 


tgatggttat 


ctcgagggta 


gagaaatcga 


z z zU 


atgccccttg 


catcaaggtc 


ggtttgacgt 


ttgcacaggc 


aaagccctgt 


gcgcacccgt 


o *3 o n 
zz a U 


gacacagaac 


atcaaaacat 


atccagtcaa 


gattgagaac 


ctgcgcgtaa 


tgaccgatx c 


n i a n 
z 3% U 


gagctaagaa 


ttttaacagg 


aggcaccccg 


ggccctagag 


cgtaatcacc 


cccattccat 


o a f\ n 
z4 UU 


lOcttttttagg 


tgaaaacatg 


aattacaata 


ataaaatctt 


ggtaagtgaa 


tctggtctga 


2460 


gccaaaagca 


cctgattcat 


ggcgatgaag 


aacttttcca 


acatgaactg 


aaaaccattt 


2520 


ttgcgcggaa 


ctggcttttt 


ctcactcatg 


atagcctgat 


tcctgccccc 


ggcgactatg 


2580 


ttaccgcaaa 


aatggggatt 


gacgaggtca 


tcgtctcccg 


gcagaacgac 


ggttcgattc 


2640 


gtgcttttct 


gaacgtttgc 


cggcatcgtg 


gcaagacgct 


ggtgagcgtg 


gaagccggca 


2700 


15atgccaaagg 


ttttgtttgc 


agctatcacg 


gctggggctt 


cggctccaac 


ggtgaactgc 


2760 


agagcgttcc 


atttgaaaaa 


gatctgtacg 


gcgagtcgct 


caataaaaaa 


tgtctggggt 


2820 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


ctacggttgc 


ttcgaccagg 


2880 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg 


gtacctggaa 


cctatgttca 


2 94 0 


agcattccgg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


ggttgtgatc 


aaggccaact 


3000 


20ggaaggcacc 


cgcggaaaac 


tttgtgggag 


atgcatacca 


cgtgggttgg 


acgcacgcgt 


3060 


cttcgcttcg 


ctcgggggag 


tctatcttct 


gctcgctcgc 


tggcaatgcg 


gcgctaccac 


3120 


ctgaaggcgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


cggcatgggt 


gtgttgtggg 


3180 


acggatattc 


aggtgtgcat 


agcgcagact 


tggttccgga 


attgatggca 


ttcggaggcg 


3240 


caaagcagga 


aaggctgaac 


aaagaaattg 


gcgatgttcg 


cgctcggatt 


tatcgcagcc 


3300 


25acctcaactg 


caccgttttc 


ccgaacaaca 


gcatgctgac 


ctgctcgggt 


gttttcaaag 


3360 


tatggaaccc 


gatcgacgca 


aacaccaccg 


aggtcfcggac 


ctacgccatt 


gtcgaaaaag 


3420 


acatgcctga 


ggatctcaag 


cgccgcttgg 


ccgactctgt 


tcagcgaacg 


gtcgggectg 


3480 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


agcttcgcaa 


aacggcaaga 


3540 


aatatcaatc 


aagagatagt 


gatctgcttt 


caaaccttgg 


tttcggtgag 


gacgtatacg 


3600 


30gcgacgcggt 


ctatccaggc 


gtcgtcggca 


aatcggcgat 


cggcgagacc 


agttatcgtg 


3660 


gtttctaccg 


ggcttaccag 


gcacacgtca 


gcagctccaa 


ctgggctgag 


ttcgagcatg 


3720 


cctctagtac 


ttggcatact 


gaacttacga 


agactactga 


tcgctaacag 


acgagtcgac 


3780 


catgatgatc 


aatattcaag 


aagacaagct 


ggtttccgcc 


cacgacgccg 


aagagattct 


3840 


tcgtttcttc 


aattgccacg 


actctgcttt 


gcaacaagaa 


gccactacgc 


tgctgaccca 


3900 


35ggaagcgcat 


ttgttggaca 


ttcaggctta 


ccgtgcttgg 


ttagagcact 


gcgtggggtc 


3960 


agaggtgcaa 


tatcaggtca 


tttcacgcga 


actgcgcgca 


gcttcagagc 


gtcgttataa 


4020 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag 


caactgaaag 


ttcgagttga 


4080 


gcatcaactg 


gatccgcaaa 


actggggcaa 


cagcccgaag 


ctgcgcttta 


ctcgctttat 


4140 


caccaacgtc 


caggccgcaa 


tggacgtaaa 


tgacaaagag 


ctacttcaca 


tccgctccaa 


4200 


40cgtcattctg 


caccgggcac 


gacgtggcaa 


tcaggtcgat 


gtcttctacg 


ccgcccggga 


4260 
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agataaatgg 


aaacgtggcg 


aaggtggagt 


acgaaaattg 


gtccagcgat 


tcgtcgatta 


4320 


cccagagcgc 


atactbpaga 


cgcacaatct 


gatggtcttt 


ctgtgattca 


gtgaccattt 


4380 


ttacaaatgg 


tcactgcaac 


cgcggtcacc 


attaatcaaa 


gggaatgtac 


gtgtatgggc 


4440 


aatcaacaag 


tcgtttcgat 


aaccggtgca 


ggctcaggaa 


tcggtctcga 


actggttcgg 


4500 


Stcctttaagt 


cggccggtta 


ttacgtatcc 


gctctcgtac 


gaaacgagga 


gcaagaggcg 


4560 


cttctttgca 


aagagttcaa 


ggacgcactc 


gagattgtag 


tgggcgatgt 


ccgggaccac 


4620 


gcaacaaatg 


agaagctgat 


aaagcaaaca 


atcgatagat 


tcggtcatct 


tgattgtttt 


A £ Q A 


attgcaaatg 


ccggtatctg 


ggattacatg 


ctgagcatcg 


aagagccttg 


ggagaaaata 


474 0 


tcgagcagtt 


ttgacgaaat 


attcgacatt 


aatgtcaaga 


gctatttcag 


tggcatcagt 


4800 


lOgccgccctgc 


cggaactgaa 


aaagactaac 


ggatcagtgg 


tgatgaccgc 


ttcggtgtcg 


4 oou 


tcccatgcgg 


tcggtggtgg 


tggttcttgc 


tacatcgcca 


gcaagcatgc 


ggtgctcggt 


A a o a 


atggttaagg 


ctttggccta 


cgaattggcc 


cccgaagttc 


gcgtgaacgc 


tgtttcgccg 


A qq n 


gggggcaccg 


tgacgtctct 


gtgcggttcc 


gcgagcgccg 


gtttcgacaa 


aatgcacatg 


DU^U 


aaagacatgc 


ccggcatcga 


cgatatgatc 


aaaggtctca 


cgcctcttgg 


gtttgcagcc 


C 1 f\ ft 


ISaagcccgaag 


acgtggtggc 


accctatttg 


ttgctggctt 


cgcgaaagca 


aggaaaattc 


C 1 C ft 


atcaccggca 


ccgtgattag 


cattgatggc 


ggtatggcgc 


tcggtcgcaa 


gtgagcttgt 


C O T A 

5220 


agccgatcag 


aagttataga 


cacatttcag 


gtgacgcccc 


atgaagacaa 


aactgtttat 


5280 


caataacgcc 


tggatcgatt 


ctagtgacca 


gcagaccttc 


gagcgcatac 


accccgtcag 


5340 


cagcgatgtg 


gtgactgaga 


gcgcaaacgc 


cacagtgacg 


gacgcgataa 


aggcggcgca 


5400 


20agcggccgag 


gaggcgttca 


agacctggaa 


ggccgttgga 


ccttcagagc 


gtcgccgcct 


5460 


tctcctaaag 


gtcgccgatg 


tcatggaaag 


taaaacaccc 


aagttcatcg 


aagtgatggc 


5520 


catggaggtg 


ggagcttccg 


ccctttgggc 


cggattcaac 


gtccatgcgt 


ctgccaatgt 


5580 


gttccgagag 


gctgcctcgc 


tggctaccca 


aattcagggt 


gaaaccatcc 


caacggacaa 


5640 


agccgaaacg 


ctctcaatga 


cactacgtca 


gccggtcggc 


ccgatcctaa 


gcatcgttcc 


5700 


25atggaacggc 


accgcagtgc 


ttgcggcacg 


agccatcgct 


tatccgctgg 


tctgtggcaa 


5760 


cactgtggtg 


ttcaaaggct 


ctgaatttag 


tcccgcgacg 


catgccctga 


tcacccagtg 


5820 


cgtgcaggaa 


gccgggctgc 


ccgctggcgt 


gctcaattac 


ctcaactctt 


cgcctgaccg 


5880 


ttcgcccgag 


atcgctgacg 


cactgatctc 


tgccaaggag 


atccgccgca 


tcaacttcac 


5940 


gggttccacc 


cgcgtgggca 


gcattatcgc 


gcagaaagcc 


gcgcaacacc 


tcaagcgctg 


6000 


30cctgctggag 


ctcggcggca 


agtccccgct 


tattgttctg 


gatgatgcag 


acatcgatgc 


6060 


ggcggtcaag 


gcagcggtgt 


tcggtagctt 


cctgttccaa 


ggtcagatct 


gcatgtccac 


6120 


tgagcgcttg 


atcgttgatg 


agaagatagc 


cgacgaattt 


gtcgcaaaat 


ttgtcgaaaa 


6180 


aactaagcgc 


ttgagcgcag 


gcgacccgtg 


cgtaactggc 


gactgcatca 


tcggcccgat 


6240 


ggtctcgcca 


aattcgggtg 


agcggatcaa 


tggtttgttc 


aaagacgcga 


tcgacaaagg 


6300 


35ggcaaaagtt 


gtttgcggcg 


gcttggccca 


aggtgcgctc 


atgccggcca 


cgatcctgga 


6360 


tcacgtcaaa 


tctgacatgc 


ggatttacga 


tgaggagacc 


tttggtccca 


tcaccgtggt 


6420 


aatccgttgt 


aaaggcgaag 


cagaggccgt 


ccgcattgcc 


aacgacagcg 


tctatggcct 


6480 


gtcgtcgggc 


gtatttggcc 


gcgacatcaa 


ccgcgctcta 


cgcgtgggta 


tgtccatcga 


6540 


atatggttct 


gtacacatca 


acggttcgac 


cgtccagaac 


gaggcgcagg 


ctccttacgg 


6600 


40aggcaccaag 


aacaccggct 


acgggcgctt 


cgacggccgt 


gctgtaatcg 


acgagttcac 


6660 
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agagatcaag 


tggctgacca 


tcgaaccttt 


cgagcagcaa 


tatcccttct 


gataagcact 


672 0 


aactcccagg 


aatcaaacta 


tgagtaagca 


agctgcagtt 


atcgagctcg 


gatacatggg 


b /oU 


tatctcggtc 


aaggaccctg 


atgcgtggaa 


atcatttgcc 


acggatatgc 


taggtctgca 


OOH U 


agttcttgat 


gagggtgaga 


aggaccgttt 


ctatctgcgg 


atggattact 


ggcatcatcg 




Sgatcgtagtc 


catcacaacg 


gacaggacga 


cttggagtac 


ctaggctggc 


gcgtagccgg 


6960 


caagccggag 


ttcgaagctc 


tgggtcaaaa 


gcttattgac 


gccggttaca 


agacccgcat 


/ \J £, U 


ctgcgacaaa 


gttgaggctc 


aggagcgtat 


ggtgttgggt 


ctgatgaaga 


cagaagaucc 


/ u o u 


gggcggcaac 


ccgaccgaga 


tattctgggg 


cccccggatc 


gacatgagca 


acccgttcca 




tcccggtcgc 


cccctgcacg 


gaaagtttgt 


gaccggtgac 


caaggcttgg 


gccattgcat 


7200 


lOcgttcgccaa 


accgacgtcg 


cagaagctca 


taagttttat 


agcctgctgg 


gcttccgtgg 


*7 T C f\ 

7260 


ggacgtcgaa 


taccggattc 


cgttgcccaa 


cggcatgact 


gccgaactgt 


cgttcatgca 


7320 


ttgcaacgcc 


cgtgatcact 


ccattgcttt 


tggtgccatg 


cccgctgcca 


aacgactcaa 


73 80 


tcacttgatg 


cttgagtaca 


cccatatgga 


agacttggga 


tacacgcacc 


aacagtttgt 


7440 


aaagaacgaa 


attgacattg 


ccttgcagct 


tggcattcac 


gccaacgaca 


aggcgttgac 


7500 


15gttctatggt 


gcaacgcctt 


cgggctggct 


cattgagccc 


ggctggcgag 


gtgccacggc 


7560 


catagatgaa 


gcggagtatt 


acgtcggcga 


catcttcggc 


catggcgtgg 


aggccactgg 


7620 


atatggcctg 


gatgtaaaac 


tgagctaaag 


atgcgcgctc 


gttgggcgag 


gctctagtcc 


7680 


agcatcttca 


tacgcaacca 


accttgcagg 


gcgatgagat 


caaaggacgt 


taaagcgaag 


7740 


gggaagtggt 


tcgggccatg 


cgcataccga 


tccatgacat 


ttgtttcata 


gtatataggt 


7800 


20agataggtga 


atcaagcgct 


tagtcaacta 


gtggacacat 


ctgttccatg 


aggctatcta 


7860 


ctatctattc 


aaaacaagaa 


taataaatag 


gatgaaaata 


ataatgataa 


aaagaacgat 


7920 


ttgtcttgtg 


tatcctctat 


tctgtttggc 


aagccccaca 


tgggccgaag 


agtcgccttg 


7980 


gacgtaccgt 


attggtatga 


ctaatgtagc 


tttcgatgct 


agcgcaaaag 


tatacttaaa 


8040 


tggtcagcgg 


gtgccaggag 


gaagcgctga 


tgcgagcgat 


aacaacgcgc 


ttacattcga 


8100 


25cttcggctac 


gccatcaacg 


accagtggaa 


tgtacgtgcg 


attgtcggta 


ttccgcctac 


8160 


aactaaagtg 


acgggcgcag 


gcacacttcc 


tggtatccag 


ctggggaaaa 


taacttacgc 


8220 


tccaacagta 


ttaacgttga 


actataacct 


ccccgctttg 


ggtcccgttc 


gccctcacat 


8280 


aggtgcggga 


gtcaattaca 


cgcggatttt 


Cgaaagtcgg 


gacgctaatc 


taaaatcgtt 


8340 


cgatgccgac 


cacgcttggt 


cccccgcgct 


acatgttggt 


gccgatattg 


acgttaaccg 


6400 


30tggttggttc 


gttagcattg 


atatccggaa 


gttatacctg 


aaaagcgacg 


catcagggta 


8460 


cttggggcca 


caggaggcta 


aagcacgggt 


aactcttgac 


ccattactaa 


cttcgatcgc 


8520 


gatcggacgc 


caattctgat 


gattctgttt 


aaagttcttt 


atctatctaa 


ccgcaaaggg 


8580 


tgtttccatg 


tcgaataaaa 


ttatgaaaac 


gtcgcgtctt 


accgccgaag 


atatcaacgg 


8640 


cgcctggact 


ataatgccca 


caccctcgac 


gcctgatgct 


tctgattggc 


gcagcactgc 


8700 


35caccgtggac 


ttagaagaga 


ctgcccgcat 


agttgaagag 


ctgattgcag 


ctggtgtcaa 


8760 


cggtattcta 


agtatgggta 


cttttggtga 


gtgcgccacg 


ttgacctggg 


atgaaaaacg 


8820 


tgattatgtc 


tcgacgattg 


tcgagaccat 


tcgtggtcgc 


gtgccttatt 


tctgtggcac 


8880 


gacagcctta 


aatacccgag 


aagtcatccg 


ccagacccga 


gagcttatcg 


atattggcgc 


8940 


caacggcact 


atgctcgggg 


tgccgatgtg 


ggtgaagatg 


gacctgccta 


cagcggttca 


9000 


40gttctatcgt 


gatgttgcag 


atgcggtacc 


agaggctgcc 


attgcgattt 


acgccaaccc 


9060 
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cgaagcattc 


aagttcgact 


tccctcgccc 


attctgggca 


gagatgtcca 


aaattccgca 


9120 


ggtagtgact 


gcgaagt:atc 


taggcatcgg 


aatgcttgac 


ttggacctga 


gactggcacc 


9180 


caacatccgc 


ttccttcccc 


acgaagatga 


ctattacgcg 


gccgcacgca 


tcaatcccga 


9240 


gcgcataacc 


gcgttctggt 


caagcggggc 


catgtgcggc 


ccggctaccg 


ccatcatgtt 


9300 


5gcgtgacgaa 


gtggtgcggg 


ccaagagcac 


cggtgactgg gccaaggcca 


aagccatctc 


9360 


cgatgatatg 


cgtgcagccg 


actcgacatt 


gtttccgcgt 


ggcgactttt 


cggagttctc 


9420 


gaagtataat 


atcgggcttg 


aaaaggcacg 


gatggatgcg gctggttggc 


tcaaggctgg 


9480 


gccctgccgt 


ccgccctaca 


accttgttcc 


agaagactac 


ctcgctggtg 


cacagaaatc 


9540 


aggcaaggct 


tgggccgcgc 


tgcacgctaa 


atacagtaat 


gaattgaagt 


agttcacctc 


9600 


lOcgcagacctg 


agtgacaggg 


tggcgcagac 


gctgagggtg 


caggaattaa 


gtgagctaaa 


9660 


gcacatttct 


tgcgccaggc 


attgccagat 


cagcaaagtt 


tgctga 




9706 



15 <210> 6 

<211> 2294 
<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> A modified DNA sequence 

<221> unsure 
<222> (186) . . . (186) 
25 <223> n = a or t or g or c 

<400> 6 

agggcagckt ctgtatttgc ggcagcgaaa tgctccctaa attcctcatt taccccatct 60 

gaggattgct ttatgacagt aaagtggatt gaagcagtcg ctctttctga catccttgaa 12 0 

30 ggtgacgtcc tcggcgtgac tgtcgagggc aaggagctgg cgctgtatga agttgaaggc 180 

gaaatntacg ctaccgacaa cctgtgcacg catggttccg cccgcatgag tgatggttat 240 

ctcgagggta gagaaatcga atgccccttg catcaaggtc ggtttgacgt ttgcacaggc 300 

aaagccctgt gcgcacccgt gacacagaac atcaaaacat atccagtcaa gattgagaac 360 

ctgcgcgtaa tgattgattt gagctaagaa ttttaacagg aggcaccccg ggccctagag 420 

35 cgtaatcacc cccattccat cttttttagg tgaaaacatg aattacaata ataaaatctt 480 

ggtaagtgaa tttggtctga gccaaaagca cctgattcat ggcgatgaag aacttttcca 540 

acatgaactg aaaaccattt ttgcgcggaa ctggcttttt ctcactcatg atagcctgat 600 

tcctgccccc ggcgactatg ttaccgcaaa aatggggatt gacgaggtca tcgtctcccg 660 

gcagaacgas ggttcgattc gtgcttttct gaacgtttgc cggcatcgtg gcaagacgct 720 

40 ggtgagcgtg gaagccggca atgccaaagg ttttgtttgc agctatcacg gctggggctt 780 



15 





cggctccaac 


ggtgaactgc 


agagcgttcc 


atttgaaaaa gatctgtacg gcgagtcgct 


840 




caataaaaaa 


tgtctrggggt 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


900 




ctacggttgc 


ttcgaccagg 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg 


960 




gtacctggaa 


cctatgttca 


agcattccgg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


1020 


5 


ggttgtgatc 


aaggccaact 


ggaaggcacc 


cgcggaaaac 


tttgtgggag 


atgcatacca 


1080 




cgtgggttgg 


acgcacgcgt 


cttcgcttcg 


ctcgggggag 


tctatcttct 


cgtcgctcgc 


1 14 0 




tggcaatgcg 


gcgctaccac 


ctgaaggcgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


1200 




cggcatgggt 


gtgttgtggg 


acggatattc 


aggtgtgcat 


agcgcagact 


tggttccgga 


1260 




attgatggca 


ttcggaggcg 


caaagcagga 


aaggctgaac 


aaagaaattg gcgatgttcg 


1320 


10 


cgctcggatt 


tatcgcagcc 


acctcaactg 


caccgttttc 


ccgaacaaca 


gcatgctgac 


1380 




ctgctccggt 


gttttcaaag 


tatggaaccc 


gatcgacgca 


aacaccaccg 


aggtctggac 


1440 




ctacgccatt gtcgaaaaag 


acatgcctga ggatctcaag cgccgcttgg ccgactctgt 


1500 




tcagcgaacg gtcgggcctg 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


1560 




agcttcgcaa 


aacggcaaga 


aatatcaatc 


aagagatagt 


gatctgcttt 


caaaccttgg 


1620 


15 


tttcggtgag gacgtatacg 


gcgacgcggt 


ctatccaggc 


gtcgtcggca 


aatcggcgat 


1680 




cggcgagacc 


agttatcgtg 


gtttctaccg 


ggcttaccga 


gcacacgtca 


gcagctccaa 


1740 




ctgggctgag 


ttcgagcatg 


cctctagtac 


ttggcatact 


gaacttacga 


agactactga 


1800 




tcgctaacag 


acgagtcgac 


catgatgatc 


aatattcaag 


aagacaagct 


ggtttccgcc 


1860 




cacgacgccg 


aagagattct 


tcgtttcttc 


aattgccacg 


actctgcttt 


gcaacaagaa 


1920 


20 gccactacgc 


tgctgaccca 


ggaagcgcat 


ttgttggaca 


ttcaggctta 


ccgtgcttgg 


1980 




ttagagcact 


gcgtggggtc 


agaggtgcaa 


tatcaggtca 


tttcacgcga 


actgcgcgca 


2040 




gcttcagagc 


gtcgttataa 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag 


2100 




caactgaaag 


ttcgagttga 


gcatcaactg gatccgcaaa actggggcaa cagcccgaag 


2160 




ctgcgcttta 


ctcgctttat 


caccaacgtc 


caggccgcaa 


tggacgtaaa 


tgacaaagag 


2220 


25 


ctacttcaca 


tccgctccaa 


cgtcattctg 


caccgggcac 


gacgtggcaa 


tcaggtcgat 


2280 




gtcttctacg 


ccgc 










2294 



<210> 7 
<211> 4355 
30 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> A modified DNA sequence 

35 

<400> 7 

atcgataaaa ccatgagggt aataaataat gatgaaggag actatctaga attcgttctg 
gagacggtgc cggtggcata accgtagacc agcaggctga gcaaaatacc tcggggtgat 
aagccgcgct accgcgtccg ccatatgcct gagtgagtgc gctaaggtcg ggccgctcaa 
40 tggcttcgag aataaagcgg gccaggtgat cttctgtcaa tctcgccgaa tgggtggaag 



16 





cacatcaacc 


tgaccggcga 


ttacgtctgg 


cggcagagcc 


gcagactgga 


ggtcgggaag 


300 




ttccggccgt 


tacggtzggcc 


ctgaaaacct 


tagtgtacga 


ttttttccgt 


cttctctatt 


3 60 




cggcctgcgt 


tgcgacgttc 


gatctgacta 


gggcgcagcg 


gcaacatcgg 


ctgagtacga 


A O A 

4zU 




tccagcgcct 


ggatctgggt 


tatttcgtca 


accgacaaca 


ccagagcgtt 


atccggtggg 




5 


ttcaggtaaa gcccaaccac 


attgactacc 


tctcggcaaa 


atgagggtcg 


ttactaattt 


C A A 




tgaacgtctt 


caagcgatgg 


ggttttaggt 


cggaagctgc 


ccagacctgc 


ccaacctacc 


buu 




acgtcgtgac 


ccgagcatat 


tcggcccgcg 


gcggcaaaaa 


acacagaaat 


gagcgggtga 


D OU 




cccgatcgcc 


tttgatcgat 


tctccgcttt 


caaaatgggc gggggctgaa gtcagccaga 


f 




aatacatgac 


tacttcagac 


gattcgcagt 


attcacgctg gtgataaaca 


aattcaacta 


T O A 
/ O U 


10 


tgctttattg 


acaaataaaa 


gcacgctcac 


catcatcgcg 


aatacaaatc 


ttataaaaat 


840 




taagccggat 


ttatggaact 


tctcatacag 


ccaaacaatc 


gcctcattag 


ctttagtccc 


900 




ggcgccaacc 


ttctggaagt 


gcttcgcgaa 


aacggtgtcg 


ctatttccta 


cagttgtatg 


960 




tctgggcgtt 


gcggaacctg 


ccgctgtcgg 


gttacagatg gcagtgtaat 


tgattcgggg 


1020 




gcgggaagcg 


ggttaccaaa 


cctcgtggac 


gagcattatg 


tgctcgcctg 


tcagtcagta 


1080 


15 


cttactcaca 


attgcgcgat 


cgaaatccca 


gaaaccgacg aaatcgtcac 


ccacccggcg 


114 0, 




agaatcatca t agggcactgt 


ggtcgccgtc 


gagtcgccca 


ctcacgatat 


ccgtcgccta 


XZKJV 




cgcgtacgcc 


tcgctaagcc 


cttcgagttc 


tcacccggac 


agtacgcgac 


attgcagttc 


1 *> C A 

lzoU 




agtcctgagc 


atgcgcgtcc 


gtattcaatg 


gcaggtctgc 


cagatgacca 


agaaatggag 


n i o a 




ttccacatac 


gcaaggtgcc 


gggtgggcgc 


gtaacggagt 


atgttttcga 


gcacgtccgc 


13 80 


20 gaaggtacaa gcatcaagtt 


gagcgggcca 


cttggtacgg 


cttatttgcg 


tcagaaccac 


1440 




accgggccga 


tgctctgtgt 


gggcggtggg 


accggactag caccggtgct gtcgattgtt 


1500 




cgcggcgcgc 


tgaagttggg 


tatgacaaac 


cccatcctcc 


tttatttcgg 


agtgcgcagt 


1560 




cagcaagacc 


tctacgacgc 


agagcgattg 


cacaaactcg 


ccgctgatca 


ccctcaactg 


1620 




accgtacaca 


cggtaatcgc 


aatgggcccg 


attaatgaga 


gtcagcgagc 


cggtctagtt 


1680 


25 


accgatgtga 


tcgaaaaaga 


catcatttcg 


ctggctgggt ggagggccta 


cctgtgcggc 


1740 




gcaccagcga 


tggttgaagc 


gctttgcacc 


gttaccaagc 


atcttggaat 


atcacccgaa 


"1 O A A 

1800 




catatttatg 


ccgatgcctt 


ctatcccggt 


ggaatctgaa 


tagtcccttt 


ccctgcacct 


T O C A 




ctgtccatcg 


aggactcatc 


aggaggatac 


tcaaataagc 


gcaaataata 


acagtcgcgt 






cagtatttgc 


ggcagcgaaa 


tggtttccct 


ctccctcatt 


taccccatct 


gaggatagtt 


1 Q Q A 


30 


ttatgacaga 


aaaatggatt 


gaagcagtcg 


ccctttctga 


catcccagaa 


ggtgatgtcc 


2 040 




tcggcgtgac 


tgtcgagggc 


aaggagctgg 


cgttgtacga 


agtggaaggc 


gaaatctacg 


*} 1 A A 




ctaccgacaa 


cctgtgcacg 


catggtgccg 


cccgcatgag 


cgatggttat 


ctcgaggggc 






gagaaatcga 


atgccccttg 


catcaaggtc 


ggtttgacgt 


ttgtacaggc 


agagccctct 






gcgcccccgt 


gacagagaac 


atcaaaacat 


atgcagtcaa gattgagaac 


ctgcgcgtaa 


2280 


35 


tgattgattt 


gagcggagag 


ttttaatagg 


aggtaccccg gaccctagcg 


cgtaactacc 


2340 




ccaattccat 


cttttttagg 


tgaaaacatg 


aattacaaaa 


acaaaatatt 


ggtgagtgaa 


2400 




tctgggctga 


cccaaaagca 


cctgattcat 


ggcgatgaag aacttttcca gcacgaactg 


2460 




agaaccattt 


ttgcgcggaa 


ctggcttttt 


ctcactcatg 


acagcctgat 


tccatccccc 


2520 




ggcgactatg 


ttaccgcaaa 


aatgggtatt 


gacgaggtca 


tcgtctctcg gcagagcgac 


2580 


40 ggttcgattc gtgccttcct 


gaacgtttgt 


cggcaccgtg 


gcaagacgct 


ggttaacgcg 


2640 



17 



gaagccggca 


atgccaaagg 


tttcgtttgc 


agctatcacg 


gctggggctt 


cggctccaac 


2700 


ggtgaactgc 


agagcgttcc 


attcgaaaaa 


gagctgtacg 


gcgagtcgct 


caacaaaaaa 


2760 


tgtctggggt 


tgaaagaagt 


cgctcgcgtg 


gagagcttcc 


atggcttcat 


ctatggttgc 


2820 


ttcgatcagg 


aggcccctcc 


tcttatggac 


tatctgggtg 


acgctgcttg 


gtacctagag 


2880 


cccatcttca 


aacattcagg 


cggtttagaa 


ctggtcggtc 


ctccaggcaa 


ggttgtgatc 


2940 


aaggccaact 


ggaaggcacc 


cgcggaaaac 


tttgtggggg 


atgcatacca 


cgtgggttgg 


3000 


acgcacgcgt 


cttcgcttcg 


ctcgggagag 


tctatcttcg 


cgtcgctcgc 


tggcaacgca 


3060 


gtgctgcccc 


ctgaaggtgc 


aggcttgcaa 


atgacctcca 


aatacggcag 


cggcatgggt 


3120 


gtgttgtggg 


acggatattc 


aggcgtgcat 


agcgcagact 


tggttccgga gttgatggca 


3180 


ttcggcggct 


ctaagcagga 


aaggctgaac 


aaagaaattg 


gcgatgttcg 


cgcccggatt 


3240 


tatcgcagcc 


acctcaactg 


caccgttttc 


ccgaacaaca 


gcatgctgac 


ctgctcgggt 


3300 


gttttcaaag 


tatggaaccc 


gatcgacgca 


aacaccaccg 


aggtctggac 


ctacgccatt 


3360 


gtcgaaaaag 


acatgcccga 


ggatctcaag 


cgccgcttgg 


ccgactcggt 


tcagcgtacg 


3420 


gttgggcctg 


ctggcttctg 


ggaaagcgac 


gacaatgaca 


atatggaaac 


agcgtcgcaa 


3480 


aacggcaaga 


aatatcaatc 


cagagatagt 


gatctgcttt 


caaaccttgg 


tttcggtaag 


3540 


gacgtatacg 


gcgacgcggt 


ctatcctggc 


gtcgtcggca 


aatcggcgat 


cggcgagacc 


3600 


agttatcgtg 


gtttctaccg 


ggcttaccag 


gcacacgtca 


gcagctccaa 


ctgggctgag 


3660 


ttcgaggatg 


cctctagtac 


ttggcatacc 


gaactgacga 


agactactga 


tcgctaacag 


3720 


acgagtcgac 


catgatgatc 


aatattcaag 


aagacaagct 


ggtctccgcc 


cacgacgccc 


3780 


aagagtttct 


tcgtttcttc 


aattgccacg 


acgcggcttt 


gcaacaagaa 


gccaccacgc 


3840 


tgctgaaccg 


ggaagcgcat 


ctgttggaca 


ttcaggctta 


ccgggcttgg 


ttagagcact 


3900 


gcgtggggtc 


agaggttcaa 


tatcaggtca 


tttcacgcga 


actgcgcgcc 


gcttccgagc 


3960 


gccgttataa 


gctcaatgaa 


gccatgaacg 


tttacaacga 


aaattttcag 


caactgaaag 


4020 


ttcgaatcga 


gcatcaactg 


gatccgcaaa 


actggagcaa 


cagcccgaag 


ctgcgcttta 


4080 


ctcgcttcat 


caccaatgtc 


caggccgcaa 


gggacgtaga 


tgacgaagag ctacttcaca 


4140 


tccgctccaa 


cgtcattctg 


caccgggcac 


gacgtggcaa 


tcaggtcgat 


gtcttctacg 


4200 


ccgcccggga 


agacaaatgg 


aaacgtggcg 


aaggtggagt 


gcgaaaattg gtccagcgat 


4260 


tcgtggatta 


cccagagcgc 


atacttcaga 


cgcacaatct 


gatggtcttt 


ctgtgatcca 


4320 


gtgaccactt 


ttacaaatgg 


tgactgctac 


cgcgg 






4355 



30 
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aaatctacgc 


taccgacgac 


ctgtgcacgc 




tcgaggggcg 


agaaa'tcgaa 


tgccccttgc 




gagccctctg 


cgcccccgtg 


acagagaaca 




tgcgcgtaat 


gattgattta 


agctgagaat 


5 


cgtaattatc 


cccattccat 


ctttttttag 




tggtaagtga 


gtctgggctg 


acccaaaagc 




agcacgaact 


gagaaccatt 


tktgcgcgga 




ttccttcccc 


cggcgactat 


gttaccgcaa 




ggcaaagcga 


cggttcgatt 


cgtgccttcc 


10 


tggtgaacgc 


ggaagccggc 


aatgctaaag 




tcggctccaa 


cggcgaactg 


cagagcgttc 




tcaacaaaaa 


atgtctgggg 


ttgaaagaag 




tctatggttg 


cttcgatcag 


gaggcccctt 




ggtacctgga 


acccatcttc 


aaacattcag 


15 


aggttgtgat 


caaggccaac 


tggaaggcac 




acgtcggttg 


gacgcacgcg 


tcttcgcttc 




ctggcaacgc 


agtgctgccg 


cctgaaggtg 




gcggtatggg 


tgtgttgtgg 


gacggatatt 




aattgatggc 


attcggcggc 


gctaagcagg 


20 


ccgcccggat 


ttatcgcagc 


cacctcaact 




cctgctcggg 


tgttttcaaa 


gtatggaacc 




cctacgccat 


tgtcgaaaaa 


gacatgcccg 




ttcagcgaac 


ggtcgggcct 


gctggcttct 




cagcatcgca 


aaacggcaaa 


aaatatcaat 


25 


gtttcgggaa 


ggatgtatac 


ggcgacgcgg 




tcggcgagac 


cagttatcgt 


ggtttctacc 




actgggctga 


gttcgaggat 


gcctctagta 




atcgctaaca 


gacgagtcga 


ccatgatgat 




ccatgacgcc 


gaagagtttc 


ttcgtttctt 


30 


agccaccacg 


ctgctgaccc 


gggaagcgca 




gttagagcac 


tgcgtggggt 


cagaggttca 




cgcttccgag 


cgccgttata 


agctcaatga 




gcaactgaaa 


gttcgagtcg 


agcatcaact 




gctgcgcttt 


actcgcttca 


tcaccaatgt 


35 


tctgcttcac 


gtccgctcca 


acgtcgttct 




tgtcttctac 


gccgcc 





atggtgccgc 


ccgcatgagc 


gatggttatc 


120 


atcaaggtcg 


gtttgacgtt 


tgtacaggca 


180 


tcaaaacata 


tgcagtcaag 


attgagaacc 


240 


ttttaatagg 


aggcgccccg 


gaccatagag 


300 


gtgaaaacat 


gaattacaaa 


aacaaaatct 


360 


acctgattca 


tggcgatgaa 


gaacttttcc 


42 0 


actggctttt 


tctcactcat 


gacagcctga 


480 


aaatggggat 


tgacgaggtc 


atcgtctctc 




tgaacgtttg 


ccggcaccgc 


ggcaagacac 


^ n n 
bUU 


gtttcgtttg 


cagttatcac 


ggctggggct 


obU 


cattcgaaaa 


agagctgtac 


ggcgagtcgc 


720 


tcgctcgcgt 


agagagcttc 


catgggttca 


780 


ctcttatgga 


ctatctcggt 


gacgctgctt 


840 


gcggtttaga 


actggtaggt 


cctccaggta 


900 


ctgcggaaaa 


ctttgtgggt 


gatgcatacc 


960 


gcacaggcga 


atctatcttc 


tcgtcgctcg 


1020 


cgggcttgca 


aatgacctcc 


aaatacggca 


1080 


caggcgtgca 


tagcgcagac 


ttggttccgg 


1140 


aaaggctgaa 


caaagaaatt 


ggcgatgttc 


1200 


gcaccgtttt 


cccgaacaac 


agcgtgctga 


1260 


cgatcgacgc 


aaacaccacc 


9aggtctgga 


1320 


aggatctcaa 


gcgccgcttg 


gccgacgcgg 


1380 


gggaaagcga 


cgacaatgac 


aatatggaaa 


1440 


ccagagatag 


tgatctgatt 


tccaaccttg 


1500 


tctatcctgg 


cgtcgtcgga 


aaatcggcga 


1560 


gggcttacca 


ggcacacgtc 


agcagctcca 


1620 


cttggcatac 


cgaactgacg 


aagactactg 


1680 


caatactcaa 


gaagacaagc 


tggtatccgc 


1740 


caattgccac 


gactcggctt 


tgcaacaaga 


1800 


tctgctggac 


attcaggctt 


accggacttg 


1860 


atatcaagtc 


atttcacgcg 


aactgcgcgc 


1920 


agccatgaac 


gtttacaacg 


aaaatttcca 


1980 


ggattcacaa 


aactggagca 


acagcccgaa 


2040 


ccaggccgca 


atggacgtaa 


atgatgaaga 


2100 


gcaccgggca 


cgacgtggca 


atcaagtcga 


2160 
2176 



40 



<210> 9 
<211> 14462 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 

5 



<400> 9 



gtcgactccg 


gtggccaaat 


cgacgggagc 


acaagcgtta 


cagcatcggg 


gctatcggcc 


60 


tcaaccgctg 


agacgcatct 


acatacccaa 


aagcaatggc 


aagaagcgtc 


cgctgggaat 


120 


tccaacgatg 


cgggatcgtg 


cgatgcaggc 


gctgtggaaa 


ctggcacttg 


agccggttgc 


180 


lOcgaaacgcgt 


gtagatccca 


actcctatgg 


atttcggccg 


cagcgatcca 


ctgccgatgc 


240 


aatcgcacat 


tgcttcaatg 


cactggcgaa 


acgtggttcg 


gcgcactggg 


tactggaagg 


300 


tgacattcga 


ggctgtttcg 


acaatatcag 


tcacgattgg 


ctgctcacca 


acgtgcccat 


360 


ggataaggtg 


gttctgcgca 


aatggcttcg 


agcggggtat 


gtcgatcagg 


gagccttgtt 


420 


cgcaacggag 


gcaggaaccc 


cgcaaggggg 


aatcatctct 


ccggtacttg 


cgaattggac 


480 


15gctggatggc 


ctggaagatg 


tcgtccatgc 


aagcgtggct 


tcgacagcgc 


gcaagcgtaa 


540 


gccattcaag 


atacacgtcg 


tacgatatgc 


cgatgacttc 


atcatcacag 


gggccacgca 


600 


agctgttctg 


caacatcagg 


ttcgtcctgc 


aattgaggcg 


ttcctgaaac 


agcgtggact 


660 


ggaactctct 


gatgaagaga 


ctcagataac 


gcatatctca 


caaggcttcg 


attttctggg 


720 


ccagaacgta 


cgcaagtacg 


ccggcaagct 


actcatcact 


ccggcttgca 


agagtgtgaa 


780 


20ggcactgctg 


gataaggtgc 


gggaaatcgc 


gaacgcaaac 


aaggcggcga 


ctcaagctaa 


840 


cttgatcttg 


accctgaacc 


cggtaatccg 


aggatgggcc 


atgtatcatc 


gccatgttgt 


900 


cgccgccaac 


gtttcgcatg 


gatcgaccat 


cagatttggc 


aagtgttatg 


gcgctgggca 


960 


gttcgtcggc 


acgccatgaa 


aaatgcctat 


tgggtaaaac 


aacgctactt 


tcgtgtcgta 


1020 


ggtcaacggc 


actgggtttt 


cgccactcag 


gagaaagcgc 


gcggcttgag 


tcaacccgct 


1080 


25tggctgtatg 


ccgccgccag 


tgtttcgatc 


gtgcggcata 


tcaaaatatg 


cagtgcagcg 


1140 


aacccgttcg 


atccggcatg 


gacgttttac 


cttgagcgtc 


gaagagcgca 


tcgtcaggtt 


1200 


gcccagtctt 


actcaggctg 


ctggaaggct 


tgagccgtat 


gaggggtgac 


tcccaagtac 


1260 


ggttcttagg 


ggagacaggg 


gtggtaacac 


ccctgtttta 


cccgactgaa 


agcgcatatc 


1320 


ggagccgacg 


cagaatcggg 


cctggtacat 


catgtccatg 


cacagccgcc 


aacgtggctg 


1380 


30acgtgacgga 


agttgcccat 


ctgttgcacg 


gtggcgagaa 


cgttgtctgt 


gcttggggtt 


1440 


acaccggcgt 


agagaaacgg 


cctgagcacg 


acggtcggca 


agtgatctgg 


cagatctgcg 


1500 


cgacgcagca 


cctacaagca 


tttgagcaaa 


cgcagtgcgc 


cgtacaaggc 


caggcgcaag 


1560 


atcgagaagg 


ccaaagccca 


ggtacgtgcg 


aaggtcgaac 


atccgtttcg 


ggtgatcaag 


1620 


cgtcagttcg 


gttatgtgaa 


gacccgcttc 


cgtgtcctgg 


ccaagaatac 


ggcggaattg 


1680 


35accacactgt 


tcgcgctgtc 


gaactctgtg 


gatggtgcgc 


cggcaattat 


tgcctgctgc 


1740 


gggagaggtg 


cgcccgtgag 


tgacagaaaa 


ccagggcttt 


gcctcggtta 


tccaataaca 


1800 


cagcgactga 


aaatcgggca 


tttcggcatc 


cctaagccgc 


ccatttccga 


ctgatgagca 


1860 


acttgttcgg 


agtttcccta 


aagcacaagc 


gcaagccatc 


agaaaagagt 


gcctgagtgt 


1920 


gctccaagat 


agcccgaaac 


taaggccgga 


catggaaaag 


accaagatca 


cccatgtgaa 


1980 


40tgatggactt 


gtttttgtgg 


gccatcggat 


cattcgcaag 


cgtagccgat 


acggagatat 


2040 



20 



gcgtgtgatc 


acgacgatcc 


gagacaaagc 


cagaagattc 


gcagcctcac 


tgacggcgct 


2100 


gctatcaggt 


aaccaCagtg 


aaagtaagat 


cgacatggtt 


gaatcagtca 


accgaaagct 


2160 


taaaggctgg 


gcggcgttta 


tcaattcgtt 


gattcaaagc 


caaagtattc 


agctatatcg 


2220 


atcgtgtcgt 


gttttggaaa 


ctggcccatt 


ggctggctca 


aaaataccgc 


tcccaggttt 


2280 


5gtcttgcctg 


ttcctggccc 


gccgatgaac 


accacattct 


gcgcggtatc 


tgtgaactcc 


2340 


aggttggacc 


gctccctatg 


ggcatcaaca 


atatctcgtg 


gccttggctg 


ggatacgcaa 


2400 


ctaaccaaaa 


tgcgcgactg 


gcgcatgcag 


cttccatatg 


ggagacgctg 


gaagcgtcgg 


2460 


aatagaccgt 


caatcagaag 


accttcttcg 


ctcccgtcga 


actgaaagcc 


tcaccgagtg 


2520 


cgaacgtcaa 


cgggaaaggc 


gcgcaaagac 


ttgccttgct 


cacctcgtca 


cgagcgtaca 


2580 


lOaacgctgtga 


gctggctgta 


cgggtgcgct 


gcttagcgtt 


ggtttttccg 


ttctgtgagc 


2640 


caaccccgtg 


atatcgtagt 


gcaggaccat 


gccgttaagg 


agggctgcct 


ggtcagcatt 


2700 


ttcagccagc 


gccggaagtt 


cctgcacttc 


ggccttgaac 


tggcgttcca 


atgcctggat 


2760 


atggccatag 


ggaggtacgc 


tttggcgtaa 


ggcgttgcat 


gcagcttggt 


gctgtcgagg 


2820 


ctgatggtgc 


ggagcttgag 


cagcttcatt 


tgccctgcca 


actccaccgc 


ctgaaggaag 


2880 


ISttcccggcca 


agtcatcgag 


gaagcgctgc 


cggaagctgg 


ccggggtgac 


tcttgcggcg 


2940 


aggtggcgaa 


aagcgagcga 


gtcgtaactg 


gcgcgctcaa 


tcctggagct 


gggaactttg 


3000 


ccggtggcat 


aaccatagac 


catcgggctg 


agcacaatgc 


ctcgggggtg 


ataagccgct 


3060 


accgcgtccg 


ccatatgcct 


gagtgagggc 


gctaaggtcg 


ggccgctcaa 


tggcttcgag 


3120 


aataaagcgg 


gccaggtgat 


cttctgccat 


ctctcccgaa 


tgggtggaag 


cacatcaacc 


3180 


20tgaccagcga 


ttacgtctgc 


gcagagccgc 


agactagagg 


tcgggaagtt 


ccggccgtta 


3240 


cggctgcccg 


gaaaacctca 


gtgtacgatt 


ttttcagtct 


cctctattcg 


acctgcgttg 


3300 


cgacgttcga 


tctgactagg 


gcgcagcggc 


aacatcggct 


gagtacgatc 


cagcgcctgg 


3360 


atttgggtta 


tttcgttaac 


cgacaacacc 


agagcgttat 


cctgcgggtt 


caggcagagc 


3420 


ccaaccacat 


gactacctct 


cggcaaaatg 


agggtcgtta 


ctaattttga 


acgtcttcaa 


3480 


25gcgatggggt 


tttaggtcga 


aagctgccca 


ggcctgcgga 


acctaccaca 


tcgtaacccg 


3540 


agcatattcg 


gcccgcggcg 


gctaaaaaca 


cagaaatgag 


cggggtgacc 


cgatcgcctt 


3600 


tgatcgattc 


tccgctttca 


aaagcggcag 


gggctgaagt 


cagccggaaa 


tacccagacc 


3660 


aatcacaaca 


ttcatgctgg 


tgataaataa 


attcaactat 


gctttattga 


caaataaaag 


3720 


cacactcacc 


atcatcgcga 


atacaaatct 


tataaaatta 


agccggattt 


atgaaacttc 


3780 


30tcatacagcc 


aaacaatcgc 


ctcattagct 


ttagtcccgg 


cgccaacctt 


ctggaagtgc 


3840 


ttcgcgagaa 


cggtgtcgct 


atttcctaca 


gttgtatgtc 


tgggcgttgc 


ggaacctgcc 


3900 


ggtgccgggt 


tacagatggt 


agtgtaattg 


attcggggac 


gggaagcggg 


ttaccacacc 


3960 


tcgtggacga 


gcattatgtg 


ctcgcctgtc 


ggtcagtact 


tactaacaat 


tgcgcgatcg 


4020 


aaatcccaga 


agccgacgaa 


atcgtcaccc 


acccggcgag 


aatcatcaag 


ggcactgtgg 


4080 


35tcgctattga 


gtcgcccact 


cacgatatcc 


gtcgcctacg 


cgtacgcctc 


gccaagccct 


4140 


tcgagttctc 


acccggacag 


tacgcgacat 


tgcagttcag 


tcctgagcat 


gcgcgtccgt 


4200 


attcaatggc 


aggtctgcca 


gatgaccaag 


aaatggagtt 


ccacatacgc 


aaggtgccgg 


4260 


gtgggcgcgt 


aactgagtat 


gttttcgagc 


acgtccgcga 


aggtacaagc 


atcaaattga 


4320 


gcgggccact 


tggtacggct 


tatttacgtc 


agaaccacac 


cgggccgatg 


ctctgtgtgg 


4380 


40gcggtggaac 


cggactagca 


ccggtgctgt 


cgattattcg 


cggcgcgctg 


aagttgggta 


4440 



21 



tgacaaaccc 


catcctcctt 


tatttcggag 


tgcgcagtca 


gcaagacctc 


tacgacgcag 


4500 


agcgattgca 


taacctrcgcg 


gctgatcacc 


ctcaactgac 


cgtacacacg 


gtaatcgcaa 


4560 


tgggcccgat 


taatgagagt 


cagcgagccg 


gtctagttac 


cgatgcgatc 


gaaaaagaca 


4620 


tcagttcgct 


ggctgggtgg 


agggcctatc 


tgtgcggtgc 


accagcgatg 


gttgaagcgc 


A C O A 

46 oU 


Stttgcaccgt 


taccaaacat 


cttggaatat 


caccggaaca 


tatttatgcc 


gatgcc t tct 


A T A t\ 


atcccggtga 


aatctgaatc 


gtccctttcc 


ctcacctcgg 


tccattgagg 


actcatcagg 


a q n n 


aggatactca 


aataggcgca 


aataataaca 


gccgcgtcac 


tatgtgcggc 


agcgaaatgg 


A O C A 

4 o oU 


tttccctctc 


cctcatttgc 


cccatcggag 


gatagtttta 


tgacagaaaa 


atggattgaa 


4920 


gcagtcgccc 


tttgtgacat 


tccagaaggt 


gatgtcctcg 


gcgtgactgt 


cgagggtaag 


4 980 


lOgatctggcac 


tgtacgaagt 


ggaaggcgaa 


atctacgcta 


ccgacgacct 


gtgcacgcat 


5040 


ggtgccgccc 


gcatgagcga 


tggttatctc 


gaggggcgag 


aaatcgaatg 


ccccttgcat 


5100 


caaggtcggt 


ttgacgtttg 


tacaagcaga 


gccctctgcg 


cccccgtgac 


agagaacatc 


5160 


aaaacatatg 


cagtcaagat 


tgagaacctg 


cgcgtaatga 


ttgatttaag 


ctgagaattt 


5220 


ttaataggcg 


gcgccccgga 


ccatagagcg 


tgattatccc 


cattccatct 


ttttttaggt 


5280 


15gaaaacatga 


attacaaaaa 


caaaatcttg 


gtaagtgagt 


ctgggctgac 


ccaaaagcac 


5340 


ctgattcatg 


gcggtgaagg 


gcttttccag 
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tccagtgacc 


agcagacctt 


cgagcgcaag 
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14160 


gggaggtcgc 


ggcgtaccgc 


tcgccactcc 


cgacgcactc 


ttttggaagc 


ctgctgttat 


14220 


gtcgtgcgca 


cagggtgctc 


ctggcgaatg 


ctgcttcgcg 


agtttcccta 


ttgggacaat 


14280 


Sgtctataaaa 


cctttcggcg 


ctggagcgtc 


agggcaagtt 


tgagcaaatg 


cacgaccggc 


14340 


tccgtgccca 


atggcgagag 


cgggtggatc 


gtgatgaacg gccgtcagcc 


gctgttctgg 


14400 


attcccaatc 


aacgcgcacg 


tctccccaag 


gcggtgaaag 


cggttacgac 


gcaggcaaaa 


14460 


ag 












14462 
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gtcgacgcat 


gatgcaggac 


tgggccgatc 


gcctcgacct cttcgagcag 


aaccaggtcg 


60 


aggcggccag 


catgccgctc 


accgtgcatc 


tggaaggtgt gcccgcattc 


ccgagtgagc 


120 


aaaccgcaag 


cgcgccctct 


acgccggttg 


ccgcttcgcc aagcctgctc 


gtgacgaagc 


180 


cgggtgacgc 


catgccgttg 


gtttctgccg 


ccgcacatcg gctgccggca 


gtgccgcccc 


240 


25agcgatcggc 


cgcgccgctg 


gtgccttcgg 


acattcagcg cgagagaatg 


gatttgttcg 


300 


atgtcttcga 


agcgccgcac 


aaccttcccg 


tcgctgcgtt tgccaagatg 


gcgggcaaat 


360 


ctcgcaggtg 


gatcagctac 


gagatccagg 


cgggcaactt gctggcgttg 


aacgtgggca 


420 


accgcggcca 


gcgcgtgccg 


gactggcacc 


tcgacccgct caagcacgag 


ctgatccagt 


480 


ctgtcctgaa 


gctgaccagg 


ggtgcggacc 


cttggcagat ctaccatgca 


ctgctgcagc 


540 


30cgcgctcgat 


gctgcggggg 


cgctcggcac 


tggagggcgt gactgccagc 


aatctcgaca 


600 


agctcgtcat 


ggccgtgagc 


acagcggtga 


aggaaagcga atggaccccg 


ctgcgggtcg 


660 


gtggtcgtgt 


agtcaaacaa 


cgggccggta 


agggatggcc tggacattat 


tcatattagt 


720 


gatactaata 


ttcatttatg 


gtttattgac 


tattagaata cagtcaatga 


tcatggaacc 


780 


ataaagcata 


taaataaaga 


agctagatct 


atggaacttc tcgtactacc 


gaacaatcgc 


840 


35cgcttgcctt 


ttgattccgg 


tgccaacctt 


ttggaagtgc tccgtgagca 


ccgtgtgggt 


900 


atttcctaca 


gctgtatgtc 


tggacgatgc 


ggtacttgcc gctgccgagt 


tatagatggc 


960 


agcgtcatta 


gttcggcggc 


gaaaagcggt 


gactcaaatc gcatcgaaga gcattatgta 


1020 


ctcgcctgtc 


agtcagtgct 


caccagcaat 


tgcgcaattg agatcataga 


ctcagacgac 


1080 


atagtcactc 


acccggcgcg 


aatcatcaaa 


ggcatggttg tcgccgtcga gtcgcccact 


1140 


40cacgatattc 


gccgcatccg 


cattcgcctc 


gccaagccct tcgagttctc 


acccggacag 


1200 
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tacgcgatgc 


tacagttcag 


tcccgaacat 


gtgcgtccat 


attcaatggc 


tggtctgcca 


1260 


gatgaccaag 


aaatggagtt 


ccatatccgc 


aaagtgccgg 


gcgggcgtgt 


cacggagtat 


IS Z U 


attttcgagc 


acgtccgcga 


aggtacaagc 


attaagttga 


gtgggcctct 


tggtacggcc 


lioU 


tatctgcgtc 


aggctcacac 


cgggccgatg 


ctgtgtgtgg 


gcggcgggac 


cggactcgca 




Sccggtgctgt 


cgattgttcg 


c 99 c 9 c 9 ct 9 


aagtcgggaa 


tgacgaaccc 


catccacctt 


1500 


tatttcgggg 


tgcgcagtca 


gcaagacctt 


tacgacgcag 


accgattgaa 


ccaactcgcg 


15 60 


gctatccacc 


ctcaactgac 


tgtccataca 


gtgatcgcga 


cgggcccgat 


taatgagggc 


t con 


cagcgggccg 


gcctaattac 


cgatttgatt 


gaaaaagaca 


ttccctcgct 


ggctgggtgg 


1 £ Q A 

lb oU 


agagcctacc 


tgtgcggcgc 


accagcgatg 


gttgacgctc 


tatgcaccgt 


cgccaaagat 


1740 


lOcttggaatat 


cgcccgagca 


tatttatgcc 


gacgccttct 


atcccagcgg 


ggtctgaatt 


1800 


gccccgaccc 


ttcacctctg 


tacatcgaga 


attcatcagg 


aagacactta 


aatgagcatc 


1860 


actaacaaca 


gccgcgtctg 


aatatttagg 


acagctggat 


gatctctaac 


tccatcatta 


1920 


ccccatttga 


agatagcttt 


atgacagaaa 


aatggattga 


cgcagtcgct 


ctttatgaaa 


1980 


tccctgaagg 


tgacgtcctc 


ggcgtgacag 


tcgaaggtaa 


ggaactagcg 


ctgtatgaag 


2040 


IStggaaggcga 


aatctacgct 


accgacaacc 


tgtgcacaca 


tggtgctgcc 


cgcatgagtg 


2100 


atggctttct 


agaaggcaga 


gaaattgaat 


gtcctttgca 


tcaaggtaga 


tttgatgttt 


2160 


gcacaggcag 


ggccttgtgc 


gcccctgtga 


cacagaacat 


caaaacatac 


ccggtgaaga 


2220 


ttgagggcca 


gcgtgtgatg 


attgatttga 


gctgagaatt 


ttaataggag 


gcaccccgga 


2280 


ccctagagcg 


taatcccccc 


cattcgatct 


cttgaggtga 


aaatatgaat 


tacaaaaata 


2340 


20aaaacttggt 


gagtgaatct 


gggctgaccc 


aaaaacacct 


gattcatggc 


gacgaagaac 


2400 


ttttccagcg 


cgaactggaa 


accatttttg 


ctcggaactg 


gcttttcctg 


actcatgaca 


2460 


gcctgattcc 


gtcccctggc 


gactatgtta 


cggcaaaaat 


gggggttgat 


gaggttatcg 


2520 


tctccaggca 


gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgtcgt 


caccgtggca 


2580 


agacgctggt 


acacgcagaa 


gcaggtaatg 


ctaaaggttt 


cgtttgcagc 


tatcacggct 


2640 


25ggggcttcgg 


cgctaacggt 


gaactgcaga 


gcgtcccgtt 


tgaaaaagaa 


ctgtatggcg 


2700 


aggcgctcga 


caagaaatgt 


atgggattga 


aagaagtcgc 


tcgtgtagag 


agcttccatg , 


2760 


gcttcatcta 


tggttgcttc 


gatgaggaag 


ccccttctct 


caaagactac 


atgggggacg 


2820 


ctggctggta 


cctggagcct 


atgtttaagc 


attccggagg 


gctagaactg 


atcggtcctc 


2880 


caggaaaggt 


cataatcaag 


gctaactgga 


aagcgcccgc 


ggaaaacttt 


acgggggatg 


2940 


30cgtaccacgt 


gggttggacg 


catgcgtctt 


cgcttcgctc 


agggcagtcg 


gtcttctcgt 


3000 


cgttagctgg 


caacgcagct 


ttgcccccag 


aaggtgcagg 


tctgcaaatg 


acctccaaat 


3060 


acggcagcgg 


catgggtgtg 


ttgtgggacg 


gatattcagg 


cgtgcacagc 


gcagacctgg 


3120 


ttccggaatt 


gatggccttc 


ggcggtgcta 


agcaggaacg 


gctgaacaaa 


gaaattggcg 


3180 


aggttcgcgc 


acgaatctat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagtt 


3240 


35ttctgacctg 


ctcgggtgtc 


ttcaaggtat 


ggcacccgat 


cgacgcaaat 


accactgagg 


3300 


tatggaccta 


cgccatggtc 


gaaaaagaca 


tgcccgagga 


tctcaagcgc 


cgcttggtcg 


3360 


acgcggttca 


gagaacggtt 


gggcctgctg 


gcttctggga 


aagcgacgac 


aacgacaata 


3420 


tggaaacggt 


atcgcaaaac 


gccaagaaat 


atcagtccag 


agatggcgat 


ctggtttcca 


3480 


acctgggttt 


cggcggggac 


gtatacggcg 


acgaggttta 


tcctggcatc 


gtcggcaaat 


3540 


40cggcgattgg 


cgagaccagt 


tatcgtggct 


tctatcgggc 


ttacggcgcg 


cacatcagca 


3600 



27 



gctctagctg 


ggctgaattc 


gaggatgtct 


ctaaaaattg 


gcataccgaa 


ctggcaaaga 


3660 


ctactgatcg 


ctaacagacg 


agagggacca 


tgatgattaa 


tattcaggaa 


gacaagcttg 


3720 


tctccgccca 


cgacgccgaa 


gagtttcttc 


gtttcttcaa 


ttccggcgac 


gaggctttgc 


3780 


aacaagaagc 


taccacgttg 


ctaacccggg 


aagcgcatct 


tttagacatt 


caggcttacc 


3840 


Sgcgcctggtt 


agagcactgc 


gtggactcag 


aggtgaaata 


tcagattatc 


tcacgcgaac 


3 900 


tgcgctcagc 


ttccgagcgc 


cgttaccagc 


tcaatgaaac 


catgaacatt 


ttcaacgaga 


3960 


attatgaaca 


actggaagtt 


cgcgtagcgc 


atcaactgga 


tccgcaaaac 


tggggcaata 


4020 


gtccaaaggt 


gcgctttact 


cgtttcatca 


caaatatcca 


ggctgcaatg 


gacgaaaatg 


4080 


s aagatttgct 


tcacattcgc 


tccaacctaa 


ttgttcaccg 


agcacgacgc 


ggcaatcaag 


4140 


lOtcgatgtctt 


ctatgccact 


cgggaggata 


aatggaagcg 


cggcgaagat 


ggagcgcgta 


4200 


agttggtcca 


acgattgatt 


gattatccag 


agcgcacatt 


ccagacgcac 


aatgtgatga 


4260 


tctttatgtg 


acccaataat 


cgcctttaca 


aatggtgact 


gctacaagcg 


gtcccattgt 


4 320 


tcaaaaggaa 


atttatgtgt 


atgagcaatc 


aacaagtcgt 


ttcgataacc 


ggtgctggct 


4380 


caggaattgg 


tctcgaactg 


gttcgatcct 


ttaagtccgc 


cggttattgc 


gtatccgctc 


4440 


15tcgtacaaaa 


cgaggagcaa 


aaggcgagcc 


tttgcaatga 


gttcaaggac 


gcactcgaga 


4500 


tcgtcgtggg 


cgatgtccgg 


gaccacgcaa 


caaatgagaa 


gctgataaag 


caaacaaccg 


4560 


atagattcgg 


ccatctcgat 


tgtttcattg 


caaatgccgg 


tatttgggat 


tacatgcttg 


4620 


gcatcgaaga 


gccttgggag 


aaaatatcga 


gcagttttga 


tgagatattc 


aacatcaatg 


4680 


tcaagagcta 


tttcagcggt 


atcagggccg 


ccctgcagga 


actgaaaaag 


actagcggat 


4740 


20cagtggtgat 


gaccgcttca 


gtgtcgtccc 


atgcggtcgg 


tgctggtggt 


tcttgctaca 


4800 


tcgccagcaa 


gcatgcggtc 


ctgggcatga 


tgaaagcttt 


ggcttacgaa 


ttggctcccc 


4860 


acattcgcgt 


caacgccgta 


gcaccgggcg 


gcactgtgac 


gcctctgagc 


ggtcccgcaa 


4920 


gcgccggctt 


cgacaaaact 


cacatggaaa 


acatgcccgg 


tatcgaggac 


atgatcaagg 


4980 


gtctaacgcc 


tcttggaatt 


tcagccaagg 


ccgaagacgt 


agtggcaccc 


tatttgttgt 


5040 


25tggcgtcgcg 


agatcaaggg 


aaattcatta 


ccgggactgt 


cattaatata 


gatggaggga 


5100 


tggcgctcgg 


tcgcaagtag 


gtttgtcgcc 


tatcttgaaa 


taataactaa 


atttctggta 


5160 


aaaccgcatg 


aatacaaaat 


tgtttatcaa 


caatgtctgg 


atcaattcca 


gtgaccaaca 


5220 


gaccttcgag 


cgaaagcacc 


ccgtcagtgg 


tgaggtgatg 


acggagtgtg 


caaactccac 


5280 


ggtgatggat 


gcgttaaagg 


ccgcgcaagc 


tgcccaagag 


gctttccaga 


cctggaagac 


5340 


30tgttggacct 


tcggagcgtc 


gccgccttct 


gctgagggtc 


gctgaggtta 


tggaaagtaa 


5400 


aacacccgag 


tttatcgaag 


tgatggccaa 


ggaggtggga 


gcctccgctc 


tttgggccgg 


5460 


cttcaatgtc 


cagatgtcag 


ccaatgtgtt 


ccgtgaagcg 


gcatcgctgg 


ctacacaaat 


5520 


tcagggggaa 


actattccga 


cagacaagtc 


tgacacgctc 


tcaatgacgc 


tacgtcagcc 


5580 


ggtcggtccg 


atcctgagca 


tcgtgccgtg 


gaacggcacc 


gcagtgctgg 


cggcacgagc 


5640 


35catcgcttat 


ccgctggtct 


gcggcaacgc 


ggtggtattc 


aaaggttctg 


agtttagtcc 


5700 


cgcgacgcat 


gccctgatca 


cccagtgcgt 


gcaggaagcc 


gggctgcctg 


ctggcgtgct 


5760 


caactatctc 


aactcttcgc 


ctgaccgttc 


gcccgagatc 


gccgacgcac 


tgatctcagc 


5820 


caaggagatc 


cgacgcatca 


acttcacggg 


ctccacccgc 


gtgggcagta 


ttatcgcgca 


5880 


gaaggccgcg 


caacacctca 


agcgctgcct 


gctggagctc 


ggtggcaagt 


ccccacttat 


5940 


40tgttctggat 


gatgcagaca 


tcgatgcggc 


ggtcaaggca 


gcggtgttcg 


gtagcttcct 


6000 
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gttccaaggt 


cagatctgca 


tgtccactga 


gcgcttgatc 


gttgatgaga 


agatagccga 


6060 


cgaatttgtc 


gcaaaa'tttg 


tcgaaaaaac 


taagcgcttg 


agcgcaggcg 


acccgtgcgt 


6120 


aactggcgac 


tgcatcatcg 


gcccgatggt 


ctcgccaaat 


tcgggtgagc 


ggatcaatgg 


6180 


tttgttcaaa 


gacgcgatcg 


acaaaggggc 


aaaagttgtt 


tgcggcggct 


tggcccaagg 


6240 


5tgcgctcatg 


ccggccacga 


tcctggatca 


cgtcaaatct 


gacatgcgga 


tttacgatga 


6300 


ggagaccttt 


ggtcccatca 


ccgtggtaat 


ccgttgtaaa 


ggcgaagcag 


aggccgtccg 


6360 


cattgccaac 


gacagcgtct 


atggcctgtc 


gtcgggcgta 


tttgggcgcg 


acatcaaccg 


6420 


cgctctacgc 


gtgggtatgt 


ccatcga.ata 


tggttctgta 


cacatcaacg 


gttcgaccgt 


6480 


ccagaacgag 


gcgcaggctc 


cttacggagg 


caccaagaac 


accggctacg 


ggcgcttcga 


6540 


lOcggccgtgct 


gtaatcgacg 


agttcacaga 


gatcaagtgg 


ctgaccatcg 


aacctttcga 


6600 


gcagcaatat 


cccttctgat 


aagcactaac 


tcccaggaat 


caaactatga 


gtaagcaagc 


6660 


tgcagttatc 


gagctcggat 


acatgggtat 


ctcggtcaag 


gaccctgatg 


cgtggaaatc 


6720 


atttgccacg 


gatatgctag 


gtctgcaagt 


tcttgatgag 


ggtgagaagg 


accgtttcta 


6780 


tctgcggatg 


gattactggc 


atcatcggat 


cgtagtccat 


cacaacggac 


aggacgactt 


6640 


15ggagtaccta 


ggctggcgtg 


tagccggcaa 


gccggagttc 


gaagctctgg 


gtcaaaagct 


6900 


tattgatgcc 


ggttacaaga 


tccgcatctg 


cgacaaagtt 


gaggctcagg 


agcgtatggt 


6960 


gttgggtctg 


atgaagacag 


aagatccggg 


cggcaacccg 


accgagatat 


tctggggccc 


7020 


ccggatcgac 


atgagcaacc 


cgttccatcc 


cggtcgcccc 


ctgcacggaa 


agtttgtgac 


7080 


cggtgaccaa 


ggcttgggcc 


attgcatcgt 


tcgccaaacc 


gacgtcgcag 


aagctcataa 


7140 


20gttttatagc 


ctgctgggct 


tccgtgggga 


cgtcgaatac 


cggattccgt 


tgcccaacgg 


7200 


catgactgcc 


gaactgtcgt 


tcatgcattg 


caacgcccgt 


gatcactcca 


ttgcttttgg 


7260 


tgccatgccc 


gctgccaaac 


gactcaatca 


cttgatgctt 


gagtacaccc 


atatggaaga 


7320 


cttgggatac 


acgcaccaac 


agtttgtaaa 


gaacgaaatt 


gacattgcct 


tgcagcttgg 


7380 


cattcacgcc 


aacgacaagg 


cgttgacgtt 


ctatggtgca 


acgccttcgg 


gctggctcat 


7440 


25tgagcccggc 


tggcgaggtg 


ccacggccat 


agatgaagcg 


gagtattacg 


tcggcgacat 


7500 


cttcggccat 


ggcgtggagg 


ccactggata 


tggcctggat 


gtaaaactga 


gctaaagatg 


7560 


cgcgctcgtt 


gggcgaggct 


ctagtccagc 


atcttcatac 


gcaaccaacc 


ttgcagggcg 


7620 


atgagatcaa 


aggacgttaa 


agcgaagggg 


aagtggttcg 


ggccatgcgc 


ataccgatcc 


7680 


atgacatttg 


tttcatagta 


tataggtaga 


taggtgaatc 


aagcgcttag 


tcaactagtg 


7740 


30gacacatctg 


ttccatgagg 


ctatctacta 


tctattcaaa 


acaagaatga 


taaataggat 


7800 


gaaaataata 


atgataaaaa 


gaacgatttg 


tcttgtgtat 


cctctattct 


gtttggcaag 


7860 


ccccacatgg 


gccgaagagt 


cgccttggac 


gtaccgtatt 


ggtatgacta 


atgtagcttt 


7920 


cgatgctagc 


gcaaaagtat 


acttaaatgg 


tcagcgggtg 


ccaggaggaa 


gcgctgatgc 


7980 


gagcgataac 


aacgcgctta 


cattcgactt 


cggctacgcc 


atcaacgacc 


agtggaatgt 


8040 


35acgtgcgatt 


gtcggtattc 


cgcctacaac 


taaagtgacg 


ggcgcaggca 


cacttcctgg 


8100 


tatccagctg 


gggaaaataa 


cttacgctcc 


aacagtatta 


acgttgaact 


ataacctccc 


8160 


cgctttgggt 


cccgttcgcc 


ctcacatagg 


tgcgggagtc 


aattacacgc 


ggatttttga 


8220 


aagtcgggac 


gctaatctaa 


aatcgttcga 


tgccgaccac 


gcttggtccc 


ccgcgctaca 


8280 


tgttggtgcc 


gatattgacg 


ttaaccgtgg 


ttggttcgtt 


agcattgata 


tccggaagtt 


8340 


40atacctgaaa 


accgacgcat 


cagggtactt 


ggggccacag 


gaggctaaag 


cacgggtaac 


8400 
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tcttgaccca 


ttactaactt 


cgatcgcgat 


cggacgccaa ttctgatgat 


tctgtttaaa 


8460 


gttctttatc 


tatctaaccg 


caaagggtgt 


ttccatgtcg aataaaatta 


tgaaaacgtc 


852 0 


gcgtcttacc 


gccgaagata 


tcaacggcgc 


ctggactata atgcccacac 


cctcgacgcc 


8580 


tgatgcttct 


gattggcgca 


gcactgccac 


cgtggactta gaagagactg 


cccgcatagt 


O £L A ft 


5tgaagagctg 


attgcagctg 


gtgtcaacgg 


tattctaagt atgggtactt 


ttggtgagtg 


fl*7 no. 


cgccacgttg 


acctgggatg 


aaaaacgtga 


ttatgtctcg acgattgtcg 


agaccattcg 


O / Ow 


tggtcgcgtg 


ccttatttct 


gtggcacgac 


agccttaaat acccgagaag 


tcatccgcca 


oo^u 


gacccgagag 


cttatcgata 


ttggcgccaa 


cggcactatg ctcggggtgc 


cgatgtgggt 


ft ft R 0 


gaagatggac 


ctgcctacag 


cggttcagtt 


ctatcgtgat gttgcagatg 


cggtaccaga 


R Q4 n 


lOggctgccatt 


gcgatttacg 


ccaaccccga 


agcattcaag ttcgacttcc 


ctcgcccatt 




ctgggcagag 


atgtccaaaa 


ttccgcaggt 


agtgactgcg aagtatctag gcatcggaat 


7UOU 


gcttgacttg 


gacctgagac 


tggcacccaa 


catccgcttc cttccccacg 


aagatgacta 




ttacgcggcc 


gcacgcatca 


atcccgagcg 


cataaccgcg ttctggtcaa gcggggccat 


Q1 OA 


gtgcggcccg 


gctaccgcca 


tcatgttgcg 


tgacgaagtg gtgcgggcca 


agagcaccgg 


3240 


IStgactgggcc 


aaggccaaag 


ccatctccga 


tgatatgcgt gcagccgact 


cgacattgtt 


9300 


tccgcgtggc 


gacttttcgg 


agttctcgaa 


gtataatatc gggcttgaaa aggcacggat 


9360 


ggatgcggct 


ggttggctca 


aggctgggcc 


ctgccgtccg ccctacaacc 


ttgttccaga 


9420 


agactacctc 


gctggtgcac 


agaaatcagg 


caaggcttgg gccgcgctgc 


acgctaaata 


9480 


cagtaatgaa 


ttgaagtagt 


tcacctccgc 


agacctgagt gacagggtgg 


cgcagacgct 


9540 


20gagggtgcag 


gaattaagtg 


agctaaagca 


catttcttgc gccaggcatt 


gccagatcag 


9600 


caaagtttgc 


tgatctggca 


gtttcaaaaa 


tttgggcgaa agctgatatc 


aggaatacgg 


9660 


gataaaggca 


gtgcaccata 


acgacggggg 


cgtgccattc gtgatgaacg 


attttgctat 


9720 


tgtgccgact 


tctgttcttg 


gagtgtttga 


ttgtgattgt cgatttttat 


ttcgattttt 


9780 


tgagtccgtt 


ctcttacttg 


gccaaccatc 


gtttgtcaaa gcttgcgcaa gactatggct 


9840 


25tttccattcg 


ttattacgca 


atcgatttgg 


cgcgagttaa aatagccatc ggaaacgttg 


9900 


gtccatctaa 


tcgcgacctg 


atagtcaagc 


tggactattt gaaagtagat 


ttgcaacggt 


9960 


gggccgagct 


ttacgaaata 


ccgttggtat 


tcccagctaa ctacaacagc 


cgacggatga 


10020 


atactgggct 


ttattactcg 


ggagccatgg 


cacagactgg tgcctatgtg 


aatgtagtat 


10080 


ttaatgcggt 


ttggggagat 


ggcatagctc 


cagatttgga aagcttgcct 


gctctggtat 


10140 


30ctgaaaaact 


aggctgggat 


cgtagcgcct 


tcgaggactt tatcagcagc 


gatgccgcaa 


10200 


cagagaggta 


tgacgagcag 


acacatgccg 


cgatcgaacg caaagtgttc 


ggtgtgccaa 


10260 


cgatgttttt 


gggcgatgaa 


atgtggtggg 


gaaacgaccg tctatttatg 


ctcgagaacg 


10320 


cagtgggagg 


tgcgcctgta 


aatggagaat 


agtcgctacg gagcgcttgt 


gccggctaaa 


10380 


tgccgatata 


agtggttgac 


ctgatcgtta 


tttgctcgat acagcgcttt 


caaaatcagc 


10440 


35ggctactgaa 


gtcagataaa 


aatgcgggac 


tacttcaggc atcctgtgcg 


acacaaagtt 


10500 


ttacctgtaa 


ttgtccacct 


attccgagtt 


tggaatggta gctgactcgc 


tatgcgacca 


10560 


gcgatagcct 


aacaagacat 


gcatcactgg 


taacggtggg gtgtgaagct 


cctgcaacaa 


10620 


tgtagcccct 


tgatgtgtgt 


atttgctgcg 


aggtgaagca cagatgctcg gagccgtacc 


10680 


ggcttgtggc 


gctaggctgg 


caagtatgag 


caacgtaagt gggggttggg gcgcaatggg 


10740 


40aaccaaaaac 


caacgcaagc 


cttaccagcg 


tcgttcggtg ccttcctccc 


atgcctccgc 


10800 
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ctcgataaag 
ggtcttgtcc 
cgaccgcttc 
ccgaccgcgc 

5ccgctcgacg 
gctcttctgg 
cttggagatg 
ggtctccgcc 
gatccggcct 

lOgcgctcgaac 
tcgttcaggc 
tgcttgagcg 
acatgggcta 
tgatagatgc 

IStcccgagtgc 
ttagtaatag 
gcgtccagcc 
cggctgttag 
ccgccgagtt 

20aaacagaccc 
ttgattgcct 
ctgacggcaa 
atggaggaag 
cgcattccgg 

25cctaccggca 
tacaggtgtg 
aagcgaacaa 
gcccgcgcgc 
gcatgcctcg 

30gacgcgggcg 
ggacggcatc 
gggcgttacc 
catccggcgc 
cgacaaggtg 

35 



cagctgcgca 

agcgcctcgt 

atgccgtagg 

atgaacttgc 

gttgactctg 

tacgtcccga 

tgcaacttct 

gccatggcgg 

cgagtccgtg 

tgaacaggtt 

tctgcaagcc 

agcgaccgag 

gcaacttgtc 

gttcgcaccc 

tcacccgcgt 

ttgaactttg 

gccccacctc 

ggaaactctg 

ttccaatgaa 

gcaaagagct 

tgatcgatcc 

tgctgcgggt 

cgctgtacga 

actaaaccac 

tcatgtttgc 

gcatgacacg 

tccgcaaacg 

gcattggtct 

accacatggc 

gcgacgacct 

agcgaagtcc 

tcgcgccaga 

gaaatgtcgg 

accaggatca 



tatcggcttc 

cgctccgata 

gctccaggca 

tgttgttcac 

ccggtgacgc 

tctcgacgcc 

cgccgatggc 

tggcctcggc 

ccgccgacag 

gaacaccagg 

gactttgcgt 

gcgatccagc 

caactccggc 

ggcctgtttc 

ataaccgatc 

attaagctta 

tactcagctt 

aagaagactt 

gcagatgacc 

gttcctgatc 

gcattacccc 

tcatctgatg 

gacgaccatc 

catccccaac 

tgaattttcg 

ccaaagaaat 

ttaaggctac 

ccttccgtcc 

cgtggtccgt 

cgggttgttg 

cgagcggggt 

tgagtccgtc 

gcacggcgcg 

gcgtcgac 



ctggctgatc 

cggaatggtc 

gtagcgctcg 

caccgccagg 

gatattgcgc 

acggtggcgt 

gctgacgctc 

cttggctggc 

gcccgcctga 

cgatcttggg 

gcagccagct 

ttccagatca 

cgcgcgcttt 

agggcatcga 

ttcataaaaa 

ccagttattt 

gatgaaccga 

cctgattttg 

ttcctcgacg 

gagatggatc 

aagggtgagg 

cataagtggt 

ctgcgccagt 

ttccgccggc 

cctttggctt 

caaccttggc 

caggaaaaac 

aagagtgtca 

tcatctgttg 

gcgttcggcc 

gagcccgtag 

gcgttcgagg 

ggccagcgca 



tcggttagta 

agctcgtaat 

atgttctccg 

cgcagggtga 

ttttgacctc 

aggtagctgt 

aggcggccct 

aggcccttgg 

gtccgctcgc 

cgtgggtgct 

agccgaccaa 

ccacggcatc 

ttgcgccgct 

cctggaggtc 

gtaccgttta 

gaaccctagc 

ggggcagtag 

gcaaaatgcc 

ccgagtatgc 

gggtggtgcc 

gtggcccgcc 

tcggctacag 

ttgccgggtt 

tgctggagaa 

tgagtgcggc 

ggctagaaca 

ccaggatgcg 

aaggatattg 

gccatacgtc 

cagtcgatga 

cgcacagatg 

tgacgcagcg 

ttgaatcgtt 



ggtcatgcaa 

ggccggtctc 

tggcccgctt 

cggtggccac 

gcgccagggc 

acagggtgct 

cgcggtacag 

gacggcgacc 

ggatcagctc 

gtcaatgggg 

ctcaaccagg 

gcccgctcga 

ggcgatgtct 

ggcgttctaa 

cttgactgca 

gcaggtgtaa 

ttcatcaatc 

cggatttcac 

cggtaagcgc 

gtggaagggt 

agcctatcca 

cgacccggcg 

gagcctggag 

acacgaacta 

attcaaccga 

cttgtcgatg 

gtagccgata 

gactaatgca 

tatcgaatcg 

gctcagtcag 

ggggtgtcga 

tctcggtcag 

gcggaccctg 



10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12808 
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<210> 11 
<211> 15024 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 

5 



<400> 11 



atcgtcctta 


agcgccgcac 


gcagccgttt 


aagcgaox eg 


t. uy a^-yy *-«y 


actacatcacr 


60 


ccccagtttt 


tcgccggccg 


tcgatacgct 


ccggtcgayc 


ciy LayL Lyy u 


taaaaaccac 


120 


cagcagattc 


aagtcgatgt 


cgcgcagatc 


cacgatgccu 




pa t-octaata 


180 


lOattttaacta 


tcagacttga 


tctatagcgc 


tataccgatc 


gacgcgccdy 


ctct u. t»yua.y v_ i_. 


240 


attcggagac 


aactgaaaaa 


agagcttgca 


tggaactggt 


fit - n ft ^ a f*f> 
ay C cly aa. L. v- k~ 


f« +- /-> ;a ;3 f» H t-rir* 
L. da. l. l, i— y 


300 


atctgaacgc 


ggagaccggc 


agcaccctgc 


ttgacgtgct 


cayy uCLaaL 


f» a fTfr t" /"* f /"* 3 
y dyy l. ^ v*> ^ w o 


360 


tttcttatag 


ctgcatgtcg 


ggccgctgcg 


gcacttgccg 


tegcegt-gug 


s ^ t~ ft f» ft ft rip f* 

dtuy^LyyoL 


420 


atcttcgcga 


taacggcccc 


gagacagggc 


gcccgcaggc 


— » 4 ftftfl ^ 

aggaaaggg o 


-Jf»f1f- a ftfT f- 1" 
aLLLd^y LL^ 


480 


15tggcctgtca ggcggttctg 


accgaagact 


gcacgatcga 


gat tcctgaa 


¥~ f¥ ^ ft ft 3 ft 3 

cctgacg&yci 




tcgtggttca 


cccggcgcgc 


atcgtcaagg 


ggaeggtcac 


agegatagae 


gaagccaccc 


£ ft A 


atgacatccg gcgcctgcgc 


atcaaactgg 


ccaaaccgct 


tgagttcagc 


cctggccagt 


ooU 


acgcaacggt 


gcagttcacg 


cccgaatgcg 


tccgcccata 


ttcgatggcc 


gggctgecta 


oon 


gcgatgcgga 


aatggagttt 


cagattcgcg 


cggttccggg 


cgggcatgtc 


agcaactacg 


/ oU 


20ttttcaatga actgtccgta 


ggcgcttcgg 


tgcggatcag 


cggccccctc 


ggaaeggett 


o a n 


atctgcggcg 


cacgcacacc 


ggccccatgc 


tttgtgtggg 


gggtggaaca 


ggtctggcgc 




ccgtcctttc 


gatcgttcga 


ggcgcactgg 


aaagegggat 


gagcaacccc 


atccatctgt 


you 


acttcggtgt 


gcggagcgag 


caggacatct 


atgacgagga 


acgccttcac 


gcattggctg 




caaggtttcc 


gaatctcaag 


gtgaatgtcg 


ttgttgcaac 


aggccctgcc 


ggccctggtc 


1 A Q A 


25gtcgatccgg cctggtcacc 


gatctgatcg 


geegtgaett 


geccaatttg 


gcgggatggc 


1140 


gcgcctacct 


gtgtggcgct 


ccggccatgg 


tcgaggccct 


gaacctgctc 


gttgctcgcc 


1 O A A 


taggcatagt 


acccgggcac 


atccatgccg 


atgcgttcta 


^ ft ft a fi ftftft ft 

ccccdgcygc 


ft 4" ri^/ia ft f»fr zi 

y ttLyayLya 


1260 


aggcaccatg 


cgaacccaat 


tcaacccaag 


gataccaagc 


catgagtgaa 


ccccaacgat 


J. J z u 


taaaacccgt 


gtttccccaa 


gatccgaaat 


ggcegggega 


aggtagcagc 


cgcgttccct 


1380 


30tctgggccta 


cacccgcgaa 


gacctgtaca 


agegegaatt 


ggagcgcctg 


ttctatgcaa 


1440 


accactggtg 


ctatgtaggc 


ctggaagccg 


agattccgaa 


tccaggcgac 


ttcaagcgaa 


1500 


cggtgatcgg 


tgagcgctcg 


gtcatcatgg 


tgcgtgatcc 


ggatggcggc 


atcaacgtgg 


1560 


tggagaacgt 


ctgcgcccac 


cgtggcatgc 


gettttgecg 


cgagcgccac 


ggcaacgcca 


1620 


aggacttctt 


ctgcccctac 


caccagtgga 


actacagcct 


caagggtgac 


ctgcagggcg 


1680 


35tgcccttccg 


ccgtggcgtc 


aagcaggacg 


gcaaggtcaa 


eggeggcatg 


cccaaggact 


1740 


tcaaactcga 


agaacacggc 


ctgaccaagc 


tcaaggtggc 


cgcccgaggc 


ggtgcagtgt 


1800 


ttgcctcttt 


tgaccacgat 


gtcgagccct 


tcgaggagtt 


cctgggccca 


accatcctgc 


1860 


attacttcga ccgcgtcttc 


aacggccgca 


agctcaagat 


cctgggctac 


cgccgccagc 


1920 


gcatcccggg caactggaag 


ctgatgcagg 


agaacatcaa 


ggacccctac 


cacccgggcc 


1980 


40tgctgcacac 


ctggttctcg 


accttcgggc 


tctggcgcgc 


cgacaacaag 


teggaactga 


2040 
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agatggacgc 


caagttccgc 


cacgccgcaa 


tgatctccac 


gcgcggtcag 


ggcggcaaga 


2100 


acgaggaggt 


cgtgtccggc 


gtggacagct 


tcaaggaaca 


gatgaaggtg 


aacgacccgc 


Z 1 O \J 


gcctgctcga 


catcgtgccc 


gagccctggt 


ggg9 c 99 tcc 


gactgcggtg 


acgaccacga 


2220 


tcttccccag 


cgtgatcatc 


cagcagcagg 


tcaacagcgt 


atcgacccgc 


caca tccagc 




Sccaacggtca 


cggctccttc 


gatttcgtct 


ggacccactt 


cggcttcgag 


gacgacaacg 


234 0 


aggagtggac 


ccagcgccgc 


ctgatccagg 


ccaacctgtt 


cgggccggcg 


ggcttcgtgt 




cggccgatga 


cggcgaggtg 


atcgagtggt 


cgcaggaagg 


ct ttgagcaa 


aaaccgacgc 


Z ft O U 


accgcaccgt 


gatcgagatg 


ggcggtcacg 


aaatcggcga 


cacggaccac 


atggtcaccg 


z dz u 


agacgctgat 


ccgcggcatg 


tacgactact 


ggcgcaaggt 


gatgggggaa 


taaacatggt 


2580 


lOcgacttcaaa 


acctatttcg 


aactgctgaa 


cctgtacagc 


gactacgcca 


tggtgtgcga 


2640 


ctccgccaat 


tgggagaagt 


ggcctgattt 


cttcatcgag 


accggcacct 


accgcctgca 


2700 


gccgcgcgag 


aactttgagc 


agggcctgcc 


gctgtgcctg 


ctggcgctgg 


agagcaaggc 


2760 


aatgatccgc 


gaccgggtgt 


acggcgtcaa 


ggaaaccatg 


taccacgacc 


cctactacca 


2820 


gcgccacatc 


gtgggcacgc 


cgcgcgtgct 


gtcagtggag 


cgtgacgcgg 


atggcgagcg 


2880 


15catcaccgcc 


gaagccagct 


atgccgtgat 


tcgcaccaag 


tacgacggcg 


attccacgat 


2940 


tttcaacgcc 


ggctattacc 


gagacgtgat 


cgtgcgcacg 


cccgagggcc 


tcaagctgaa 


3000 


gtcgcgcctg 


tgcgtttacg 


acagcgaaat 


gatccccaac 


tctgtgatct atccaatctg 


3060 


aggcgacgca 


catgactcag 


aactggattg 


atgcagcttg 


tcttgacgac 


atccctgaag 


3120 


gcgatgtggt 


cggcgtcaaa 


gttaacggca 


aggaaattgc 


gctctacgag 


gtcgaggggg 


3180 


20agatttatgc 


caccgataac 


ctgtgcacgc 


acggcgctgc 


gcgcatgagc 


gatggctttc 


3240 


tcgaaggccg 


ggaaattgaa 


tgccctctgc 


accaaggtcg 


atttgatgtc 


tgcacaggca 


3300 


aagccttgtg 


cacccccctg 


acaaaggaca 


tcaaaaccta 


ccccgtcaaa 


attgaaaaca 


33 60 


tgcgcgtgat 


gctcaaaatg 


gagtaagact 


gctttcaatc 


aggaggctga 


attcggacca 


3420 


atcaaccaaa 


ccatccaatc 


cacccattag 


ccaaaaaacg 


gagacattaa 


tcatgattta 


3480 


25tgaaaatttg 


gtgagtgaag 


cagggctgac 


gcaaaagcac 


ctgattcatg gcgacaaaga 


3540 


acttttccag 


cacgaattga 


agaccatctt 


cgcgcgaaac 


tggctttttc. 


tgacccatga 


3600 


cagcttgatt 


ccctcccccg 


gcgactatgt 


cacagccaaa 


atgggtgtcg 


atgaagtcat 


3660 


cgtctcccgc 


cagaacgatg 


gctcggtgcg 


agcctttttg 


aatgtttgcc 


gtcaccgggg 


3720 


caagacacta 


gttcacgctg 


aagccggaaa 


tgcgaaaggc 


tttgtgtgca 


gttaccacgg 


3780 


30ctggggcttt 


ggctccaacg 


gcgaactgca 


aagcgttccc 


tttgaaaaag 


agttgtacgg 


3840 


cgatacgatc 


aaaaagaagt 


gcctgggctt 


gaaagaagtc 


ccccgcatcg 


aaagctttca 


3900 


tggcttcatt 


tacggttgtt 


ttgatgcaga 


ggcccccacg 


cttgtcgatt 


atctgggtga 


3960 


tgcagcctgg 


tacttagaac 


ccatcttcaa 


gcactctggt 


ggcctggaac 


ttgtaggccc 


4020 


ccccggcaaa 


gtggtgatca 


aggccaactg 


gaaggctcct 


gcggaaaact 


ttgtgggtga 


4080 


35cgcgtaccac 


gttggttgga 


cgcacgcatc 


gtctttgcgc 


tcaggtcagt 


cgatatttac 


4140 


ccctcttgcg 


ggcaacgcta 


tgcttccacc 


cgaaggcgcg 


ggcttacaaa 


tgaccagcaa 


4200 


gtatggcagt 


ggaatgggcg 


tattgtggga 


cggctactcc 


ggtgtccaca 


gtgctgacct 


4260 


ggttcccgaa 


atgatggcat 


tcggcggcgc 


aaaacaggaa 


aaactcgcca 


aggaaatcgg 


4320 


cgatgtccgg 


gcacggattt 


accgcagcca 


tctaaactgc 


acggttttcc 


cgaacaacag 


4380 


40cattttgacc 


tgctccggtg 


tcttcaaggt 


ctggaacccg 


atcgatgaaa 


acacgaccga 


4440 
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ggtttggacg 


tatgccatcg 


tagaaaaaga 


catgcctgag 


gacttaaagc 


gtcgcttggc 


4500 


tgacgcggtt 


cagcgCactg 


tcggaccagc 


aggattctgg 


gaaagcgacg 


acaacgacaa 


4560 


catggagacg 


gagtcgcaaa 


atgccaagaa 


ataccaatcc 


agcaacagtg 


atctgattgc 


4620 


caatttgggt 


ttcggcaagg 


acgtctacgg 


cgacgaatgc 


tatccgggcg 


tcgttgccaa 


4680 


5atcggcaatc 


ggcgaaacca 


gctatcgcgg 


attctaccgt 


gcctaccagg 


ctcacatcag 


4740 


cagctccaat 


tgggccgagt 


tcgaaaacac 


ctcccgaaat 


tggcacaccg 


aactcaccaa 


4800 


gacgactgat 


cgctaatcca 


ggagccaacc 


atgatgatca 


atacccagga 


agacaagctg 


4 860 


gtctccgcgc 


acgacgccga 


agaatttcat 


cgtttcttcg 


tcgggcacga 


cagcgatctg 


4 920 


cagcaagaag 


tcaccacact . 


cctgacccga 


gaagcacatc 


tgttggacat 


tcaggcctac 


4980 


lOaacgcctggc 


ttgaacactg 


cgttgccccc 


gagatcaaat 


accaagtgat 


ctcgcgagaa 


5040 


tttcggtcca 


cttccgagcg 


tcgataccaa 


ctgaatgatg 


cggtgaacat 


ctacaacgag 


5100 


aactatcagc 


acctgaaagt 


tcgagttgag 


catcagatgg 


acccgcagaa 


ctgggccaac 


5160 


agcccgaaga 


tccgtttcac 


ccgcttcgtc 


accaatgtca 


cagcggccaa 


ggacaagatc 


5220 


gtaccggatc 


tgctgcatgt 


gcgttccaac 


ctcattctcc 


accgcgccag 


acgcggcaat 


5280 


IScaagttgacg 


tcttctatgc 


aacgcgggaa 


gacaaatgga 


aacgcatcga 


aggtggtggc 


5340 


atccaactgg 


tggaacgtct 


tgtggactac 


ccggagcgca 


ttctccagac 


ccacaatctg 


5400 


atgaccttcc 


tgtgaaccct 


ggggatgcct 


gccttgatgg 


cggtcatcct 


tgattgtttt 


5460 


aaacagaaat 


ttattgccat 


gaacatacag 


caagttattg 


ccattactgg 


cgccggttca 


5520 


ggcatcgggc 


tggaactggt 


tcgatccttc 


aaagcagctg 


gctattgcgt 


gtccgcactt 


5580 


20gttcgcaacg 


aggaacaaga 


ggcgggcctt 


cgcagtgaat 


tcaaagatgc 


cattgagatc 


5640 


gtagcgggcg 


atgtttgtga 


tcacgccacc 


aatgaaaagc 


tggtcaacaa 


ggcggtcgcc 


5700 


aggttcggac 


acctcgactg 


cttcatcgga 


aatgccggga 


tatgggatta 


catgctgggc 


5760 


gtcgacgagc 


catgggagaa 


attatccggc 


agtttcgagg 


agatatttga 


tatcaacgtc 


5820 


aaaagctatt 


tcagcggcat 


cagcgcggcc 


ttgccggaac 


tcaaaaaaac 


gaacggatcg 


5880 


25gtcgtagtga 


cggcttccgt 


ttcttcctat 


gcggccggcg 


gcggcggctc 


ttgctacatt 


5940 


gccagcaagc 


atgcagtgct 


gggtatggtc 


aaggcgttgg 


cctacgaatt 


ggccccgcac 


6000 


atccgggtca 


atggcgttgc 


gccaggtggt 


acggtcactt 


ctttggctgg 


gccggcaagc 


6060 


gccggtttcg 


acaaaaccaa 


aatgaaagac 


atgcccggca 


tcgatgacat 


gatcaagggc 


6120 


ctgaccccct 


tggggttcgc 


agcaaggccc 


gaggacgtgg 


tggcaccgta 


tcttttgctg 


6180 


30gcctcccggg 


aacaagggaa 


gttcatcact 


ggcaccgtaa 


tcggcattga 


tggcggcatg 


6240 


gcgctcggtc 


gaaagtgaat 


tttcaatcaa 


atcagatttt 


tcaaccccat 


tcccaggaga 


6300 


caacccatga 


agacgaaatt 


gttcatcaac 


aacacctgga 


gcgcttcgag 


tgacaaaaag 


6360 


tcattcgatc 


gcaagcaccc 


tgtcagtggc 


gaggtcgtga 


cccaatgcgc 


gaacgccacg 


6420 


gtggacgatg 


cggtcaatgc 


ggctcgagcc 


gctcaagagg 


cgttcaagtc 


ctggaaggcc 


6480 


35gtcggaccct 


cggagcggcg 


gcgccttctt 


ttgaaggtgg 


cagacgtcat 


ggagagcaaa 


6540 


acgcccgagt 


tcatcgaagt 


gatggccaag 


gaagtgggag 


cctccgcgct 


gtgggcgggg 


6600 


ttcaacgtgc 


acctgtcggc 


caatgtattc 


cgggaagccg 


cctcactggc 


cacccaaatt 


6660 


caaggcgaaa 


ccattccgac 


ggacaagcct 


gacaccctgt 


caatgacgct 


gcgtcacgct 


6720 


gtcggcccca 


tcttgagcat 


cgttccctgg 


aacggcaccg 


ccgtgctcgc 


ggcgcgggcc 


6780 


40atcgcttatc 


cgctggtctg 


cggcaatacc 


gttgtgttca 


aaggctccga 


gttcagcccc 


6840 
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ggtacgcacg 


ctttgatcac 


caagtgcctg 


caggaggccg 


acctgcctgc 


tggcgtgctc 


6900 


aactatctga 


actcctcccc 


ggaccggtcg 


cccgatattg 


cagatgcgct 


gatttcgtct 


6960 


aaagagattc 


gtcgcatcaa 


cttcacaggc 


tccactcgcg 


tggggcgcat 


catcgcccag 


7020 


aaatcggccc 


agcatctcaa gcgctgcttg ctggagttgg 


gtggaaagtc 


cccgctgatc 


*7 Ci a A 


Sgttctggacg 


acgctgacat 


cgacgcggca 


gtcaaggccg 


cggtgttcgg 


cagcttcctg 


*7 i a n 
/ L*i U 


ttccaaggcc 


agatctgcat 


gtccaccgaa 


cgcctggtgg 


tcgacgaaaa 


gatcgcggac 


7200 


gaatttgtcg 


cgaagttcgt 


cgagaaaacc 


aagcagttga 


gcgcaggcga 


tccatgcgtc 


7260 


acaggggact 


gcatcatcgg 


cccgatggtg 


tcgcccaact 


cgggtgaccg 


aatcaatggt 


7320 


ctgttcaaag 


atgccatcag 


caagggcgcc 


aaggtcgtgt 


gcggcggcat 


cgccgagggt 


7380 


lOgcggtcatgc 


ccgccacgat 


cttggaccac 


gtgacagccg 


acatgcagat 


ctacgatgag 


7440 


gaaaccttcg 


gtcccatcac 


tgtggttatc 


cggtgcaaga 


gcgaagcgga 


cgccatccgc 


7500 


attgccaatg 


acagcgccta 


cggcctgtca 


tcgggcgtgt 


ttggccggga 


cgtgaaccgg 


7560 


gctctgcgcg 


tgggcatggc gatcgaatac ggctcggtcc 


atatcaacgg 


ctccaccgta 


7620 


cagaacgagg 


ctcaggcgcc 


ttatggcgga 


acaaaggcca 


ccggttatgg ccgcttcgac 


7680 


15ggacgcgcgg 


tgatcgacga gttcacggaa ctcaagtggc 


tgaccattga accattcgag 


7740 


cagcagtatc 


ccttctaagc 


tgaagcaaca 


aaggagttaa 


accatgaaca 


agccagcaac 


7800 


tgtcattgaa 


ttggggtaca 


tgggcatttc 


ggtcaaggat 


cccgcagcgt 


ggaaatcctt 


7860 


tgccgcaaac 


atgctgggac 


ttcaagtcct 


cgatgagggt 


gacaaggatc gcttctatct 


7920 


gcgaatggac 


aattggcacc 


atcggatcgt 


ggttcatcac 


aacggtcaag 


atgaccttga 


7980 


20atacctgggc 


tggcgtgtcg 


ccggtcaacc 


ggaattcgag 


gcattgggtc 


aaaagctcgt 


8040 


ggacgcaggc 


tacaaagtcc 


gcgtgtgcga 


caaagccgaa 


gcacaagaac 


ggatggtgct 


8100 


gggcctgatg 


aagacagaag 


atccgggggg 


caacccgacc 


gagattttct 


ggggaccccg 


8160 


gattgacctg 


aacaacccct 


tccatcccgg 


tcgtcccttg 


cacgggaaat 


ttctaaccgg 


8220 


tgatcagggc 


ctgggccact 


gcatcgtgcg 


tcagaacgat 


gttgaagcgg 


cacgtaagtt 


8280 


25ctatagcttg 


ctgggatttc 


gtggagatgt 


cgagtaccgc 


cttcctttgc 


ccaacggcat 


8340 


gacggctgag 


ttgacgttca 


tgcattgcaa 


tgctcgcgat 


cattcoatcg 


ctttcggtgc 


8400 


aatgcctgcg 


gccaagcgcc 


tcaatcatct 


gatgattgaa 


tacactcata 


tcgaagattt 


8460 


gggttgcaca 


caccagcttt 


tcacgaagga 


aaagattgac 


attgccttgc 


aattgggcat 


8520 


ccattccaac 


gataaggcgc 


tgacgttcta 


cggggcaaca 


ccttccggct 


ggctgataga 


8580 


30acctgggtgg 


cgaggcgccc 


ccgccattgc 


tgaatcggaa 


tattacgtcg 


gcgacatttt 


8640 


cggccacacc 


atcgaggcca 


ccggttatgg 


attggacgtc 


aaactgagct 


agccatgtaa 


8700 


cagatgcgaa 


atcgatcgca 


tctgttttcc 


ccattcagtt 


ccatataaaa 


aaaaggagac 


8760 


aatgatgatc 


aaagaagcca 


tttccctcgc 


aggacttggg 


atgctgatgc 


tcagcaccgc 


8820 


atatgccgag 


gattcccgtt 


ggtcatatcg 


catcggcgcc 


accaacgtag 


ccttcgatgc 


8880 


35gagcgccaaa 


gtttcgattg 


acggaacaag 


ggtgccgggt 


ggaagcgctg 


acgccagtga 


8940 


caacaacgct 


ttgacatttg 


acttcggtta 


catcattaac 


gataactgga 


acgcgcgatt 


9000 


aattgttggc 


attccaccca 


ccacaaaagt 


gacgggcgca 


ggcacgctgc 


ctccgatttt 


9060 


gctgggccgt 


gtccaatatg 


ctcctgcagt 


tttgtcggcg 


acctacaacc tgccacagat 


9120 


gggattggtt 


cgcccgtacg tgggggcggg gatcaactac 


actcgaattc 


tgaaaagcaa 


9180 


40agatgccaat 


ctgacctcat 


tcgatgcaga 


tcatgcgtgg 


gcacttgtgc 


tccacatcgg 


9240 
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tgcagaagcg 


aacatcaacc 


gcgactggtt 


eg c c age tct 


y Ct LaL^^vjcia 


aactttatct 


9300 


aaaaacagat 


gcatcggggt 


1 1 c t tgggc c 


LCaagttyCt 




taacattaaa 


9360 


tccgctgctg 


acgtcgattg 


cgattggccg 


nana t* \~ t~* t~ n £i 
y d.y d u uya 


t~ c aa t~ r* c a c a 

»- y y c w w a \w ca 


fctgattccaa 


9420 


aattctgttt 


gcatcaccat 


tttcaaagga 




aLcictydciayci 


coaacaaaac 


9480 


Sggtgcgcctg accgccgcgg 


atattcaagg 


cgcatgggtc 


ateatgeega 


ccccgtccac 


9540 


gccggatgcc 


tcggactggc 


gcagcacgca 


caeggtcgat 


ctcgacgaga 


cggcccggat 


9600 


tgtcgaggag 


ttgattgcgg 


ccggcgtcaa 


tggcattctg 


agccacggca 


ectttggega 


9660 


atgcgcgacg 


ctgacgtggg 


aggagaagcg 


ggattttgtt 


teaaeggteg 


tggaaacege 


9720 


gcgcggtcga 


gtgccctact 


tctgcggcac 


aacggccttg 


aatacccgtg 


aagtcatacg 


9780 


lOccagacccgc 


gaattgatcg 


acattggcgc 


ccaaggaaca 


atgctcggcg 


tgccgatgtg 


9840 


ggtgaagatg gatctgccta 


ctgccgtgca attttatcgc gatgtggcgg aagcagtgcc 


9900 


agatgcagcc 


atcgctgtct 


acgccaaccc 


ggaggctttc 


aaatttgatt 


ttcctcgccc 


9960 


gttttgggcc 


gaaatgtcca 


aaatcccgca 


ggttgtcaca 


gecaagtact 


tgggcategg 


10020 


gatgctggac 


ttggatctga 


aattggcccc 


aaatattege 


ttccttccgc 


atgaggatga 


10080 


ISctactacgct 


gcggcccgga 


tcaatcccga gcgcatgact 


gctttctggt 


etageggtte 


10140 


catgtgcggc 


ccagcgaccg 


cccttgtgct 


gegegatgag 


gtggtaaagg 


ccaaaaatac 


10200 


aggtgattgg 


gccaaggcca 


aggctatctc 


agatgacatg 


cgcgcagccg 


atgccacact 


10260 


gtttccacgc 


ggcgatttct 


cggaattctc 


aaaatacaac 


attggecteg 


aaaaagcacg 


10320 


aatggacgag gccggctggc 


tcaaggcggg 


gccgtgccgg 


ccaccctata 


cgctggttcc 


10380 


20cgacgaatac 


cttgcaggtg 


cccgaaaatc 


aggcaaggee 


tgggccgcac 


tgcataccaa 


10440 


gtatgccaag 


gaattgagga 


aaaccaaaac 


ggcaaccaac 


tcgaaaaaga 


agtaagtcca 


10500 


ggccctgagt 


cagacatctc 


cgatcagcac 


aacctgetga 


tctggaggtt 


ttctggatta 


10560 


gtgcagtcgg 


cgagttaaaa 


atatgccagt 


acagagtgga 


agcgccacaa gcgcagggtg 


lU D Z V 


caccgatcgc 


cctgaagcaa 


tctcacacta 


ttgecatett 


ctctttgttg ggagtgcatg 


1UOOU 


25atcgtgatgg tcgattttta 


tttcgatttt 


ttgageccat 


tttegtatet 


ggccaaccac 


10740 


cgtttgtcgg 


tgctcgccgg 


gcgttatgga 


ttctccatcc 


agtatcaege 


cattgatttg 


10800 


gcgcgagcaa 


aaacggccat 


tggcaacatc 


gggccatcca 


ategggaect 


caaggtcaag 


10860 


cttgactact 


taaaggtgga 


tttgcagcga 


tgggecgate 


tctataggat 


tccgttggtt 


J. \J y £t\i 


ttccccccta 


acttcaacag 


ccgccgggtg 


aatgeeggae 


tgtattaccc 


ggcagecagg 


10980 


30gagcgagccg ctgaatatgt 


tcgccttgtt 


ttcgattegg 


cttgggggaa 


agggtgggca 


11040 


ctggatgctg 


atagcttgct 


ggctgaggta 


tgegacaage 


taaactggga 


tctcggtgaa 


11100 


tttgaagatt 


ttttgaacag 


cgaaaatgcc gccaaggcat 


acgacgaaga 


gaegcaggeg 


11160 


gccattgacc gaaaggtttt 


cggggttccc 


accgtgtttt 


gggatgatca 


aatgtggtgg 


11220 


ggaaatgacc 


gccttttcat 


gcttgagagc 


aggttgcaaa 


aggaaacgea 


accataaatt 


11280 


35ctcagtgcat cgatttttct 


gatttttcaa ccacgccttg attttaagga gtctcatgaa 


11340 


gctttattac 


agccccggcg 


catgttcatc gtcgcctcac 


atcatccttc gtgaaggtgg 


11400 


atttgacttt 


cagctagaaa 


aggttgatct 


cggcaccaag 


gtgactgaga 


ctggtgttga 


11460 


ttacaagacc 


gtcaatcccg 


tgggcagtgt ccccgctttg caaatggatg atgggcaggt 


11520 


gctcaccgaa gggccggcca 


tcgttcaata cctcgctgac cgtgtgccag aaaaatgcct 


11580 


40ggcaccggca gctggctcgc 


tggagcgtta 


ccgactgatg gaatggctga 


atttcatctc 


11640 
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caccgaattg 


cataaaagct 


tcggagcgct 


gttcagcccg 


gtgtttccac 


aggatgccaa 


11700 


gccagtcatc 


aaggcccaat 


tggaaagccg 


tcttgcccat 


accgagcaga 


tgctcggtga 


11760 


caaggtttgg 


gccatgggga 


acgacttttc 


cgtggtggac 


gcctacttgt 


ttactgtgct 


11820 


tggctggggg 


gcttacgtga 


atgtggacct 


ttcgccttgg 


cccggcctgc 


aaggctiacct 


1 1 QQrt 


Scaaccgcgtg 


gctgaacgtc 


cagcagtccg 


ggccacattg 


tcggctgaag 


gtttgatctg 


J. Xyq U 


atttttttag 


agaaataaat 


aaagcatggc 


cagcctccat 


gagcggccag 


ccatgccgcg 




gggcagccgt 


gggcagcggc 


ctcgatcaaa 


tttcgatgca 


cacagattac 


gagccccttt 


i o n c (\ 


ttccgactgc 


ggtcggattc 


agaaattgcc 


gatacttgct 


ttggtctgat 


cgaagcgtga 


12 120 


gggatatcgg 


cttcccctat 


ccacatgctt 


gatgaagagg 


agcgtatcac 


catgagtcac 


12 180 


IOgaacttggcc 


gactggaaga 


cctgccgcag 


gactaccgag 


acgaactcaa 


acaacttaac 


12240 


ctggtgccgc 


tatggcccag 


cctacgcgcg 


gtgctgccgc 


ccaatgtccc 


gacccgccag 


1 O *3 ft ft 

12 jOO 


acgcagccga 


cttactggtc 


ctaccagacg 


ctcaagccgc 


tgctgctcaa 


ggccggtgag 


1 O "i C ft 

12360 


ctgaccccca 


ttgagaaggc 


cgagcgccgc 


gtgctggtgc 


tggccaaccc 


cggccacggc 


12420 


ctggagaaga 


tgcaagccag 


cgccgccata 


tacctgggca 


tgcagttgct 


gctgcccggc 


12480 


15gagtgggcgc 


ccagccaccg 


ccacaccccc 


aacgcggtgc 


gcatgatcgt 


ggagggcgag 


12540 


ggcgcctaca 


ccaccgtgga 


tggcgagaag 


tgccccatga 


gccggggtga 


cctcatcctc 


12600 


acgcccaccg 


gcctgtggca 


cgagcacggc 


cacgacggca 


acgagcccgt 


ggtgtggctc 


12660 


gatgtgctgg 


acctgccgct 


ggtgtactac 


atggaggcca 


gctaccacat 


cgacggcgag 


1272 0 


cgccagcagg 


tcgaccccgg 


ccggggcgac 


tgcgcctgga 


cccgtgcggg 


cgtggtgcca 


1 1 "7 Q ft 

12 / tJO 


20acccccgtgt 


tccagcgcag 


cgacaagcgc 


tatcctctct 


tgcgctaccc 


ttgggccgac 


12 840 


acccgtgccg 


ccctgctgtc 


gctggcggcc 


gaccagcctg 


agcaggaatg 


cgtgcaggtc 


T 1 O ft ft 

±2 you 


acctacgtca 


accccgaaac 


gggtgacgac 


gccgagaaca 


tcctgggctt 


ctacgccctg 


12 960 


atgctcaagc 


ccggccagac 


cctgcgcctg 


cccgtgcgtt 


cgcccgccgt 


ggtgttccac 


13020 


cagatcgaag 


gccgcagcga 


ggcgcgcatc 


gccgagtcca 


ccttcgccct 


gagggaagcc 


13080 


25gatacctgct 


gcgcccccgg 


ctacaccgag 


gtgacgctga 


aaaacctctc 


agccgaccag 


13 14 0 


ccgtccttca 


tcttcatggc 


cgacgaatcg 


cccctgcacc 


gcaagctggg 


cgtctttgaa 


13200 


aaccgcggct 


gagccgccgc 


gaagcaccac 


acaacaggaa 


agcaagtgag 


caactacctc 


13260 


tggaacccgc 


ctcccgtcca 


gtctctgccc 


gttcgtggca 


agaccgagcg 


cttccccatc 


13320 


aaccgcatct 


tctgcgtcgg 


ccgcaactac 


cacgcccacg 


cggtggaaat 


gggccgtccg 


13380 


30gtcgacaaaa 


gcgtcgaaca 


agcgttctac 


ttcaccaaat 


cgccacaaac 


cctggtggaa 


13440 


agtggtgcga 


ccgtggccta 


cccgccgcgc 


accagcaact 


accactacga 


gatggagctg 


13500 


gtgctggcga 


tcggcaagcc 


cggcttccgc 


gtcagcgaag 


accaggcgca 


cgagttgatt 


13560 


tacggctacg 


ccgccggcct 


ggacatgacc 


cggcgcgacc 


tgcaactggt 


ggcacgcgac 


13620 


aagggccgcc 


cctgggacac 


cggcaaggac 


atcgaggaag 


gctcggtctg 


ctccgagatc 


13680 


35gtgccgatgc 


aaggcgtggt 


ggtggagcag 


ggcgcgatcg 


ccctggaagt 


caacggccag 


13740 


accaaacagt 


cgtccaacgt 


ggacaagctg 


atctggaacg 


tccgcgagat 


cattgccgac 


13800 


ctctccacct 


actaccactt 


gcaacccggc 


gacctgattt 


acaccggcac 


gcccgaaggc 


13860 


gtgggcgccg 


tggtggcggg 


tgacaagatc 


atcggccgtg 


tggaaggcat 


tgccgagatc 


13920 


agcctgaccg 


tcggcccggc 


cgagtgagcc 


tgcacgcgat 


gaagctgtac 


aacttctggc 


13980 


40gcagcggcac 


gtcacaccgg 


ctgcgcatcg 


cgctcaacct 


caagggcgtg 


ccctacgagt 


14040 
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acctggccgt 


gcacctgggc 


aaggaagagc 


acctaaagga 


cgcct tcaag 


y r~t /~i /t o a /"* 


14100 


cacagcagtt 


ggtgcccgcg 


ctggacacgg 


gcgcgcaagt 


gc tgatccag 




14160 


tcatcgaatg 


gctggaggaa 


cagtacccca 


cgcccgcgct 


gctgccggcc 


y en*, y \~ y cn-y 


14220 


gccgccaacg 


ggtgcgcgcg 


ctggccgcca 


tcgtgggctg 


cgacatccac 


L. L> CL 1— c* C» cl 


14280 


Saccgccgcat 


tctggagtac 


ctgcgcaaga 


cgttcggcgc 


cgacgaggcc 


vJQ_s_,dL.v— dciV-.y 


14340 


cctggtgtgg 


cacctggatc 


agcgccggtt 


tcgatgccta 


cgaagcactg 


ctggcggtcg 


14400 


acccgaagcg 


cggccgctac 


agctttggcg 


acacacccac 


gctggccgac 


tgctacctgg 


14460 


tgccgcaggt 


ggaaagcgcc 


cgccgcttcc 


aggtggacct 


gacgccctac 


cccctgatcc 


14520 


gcgcagtgga 


cgcggcctgc 


ggcgagttgg 


acgcatttcg 


gcgcgccgcg 


ccagctgctc 


14580 


lOaacctgattc 


ggcttgacga 


aaaatacgcc 


gccttcgccc 


taataaacag 


ttcgtagccc 


14640 


tgcccagtga 


caagtatcaa 


tgagcaatag 


tgaaaatgta 


aggggaaaga 


ttatgaaaca 


1 A 700 


tattctgacc 


cgacgagcag 


ccctgagcac 


tttgggctcc cttggaatcg ggggccttcc 


14760 


gggcatgagc 


ttggcgcaat 


ccggccctgg 


cgtggccacg atcgtgattg gactcgccgc 


14820 


aggcggagcg 


accgatatgg 


ctgcccgccg 


gttgtctgag ggaatgcgcg gtgcctatgc 


14880 


15ttccagtgtg 


ctcgtcgaca 


accgcacggg 


ggcgggtggt 


cgaattgcga 


tccagcatgt 


14940 


taaggccgcg 


gcaccaaacg 


gcgcgacact 


tctgctgacg 


cctgcatcca 


tgatgacgct 


15000 


ataccctcac 


acgtacaaag 


atct 








15024 



<210> 12 
<211> 4912 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 



<400> 12 

30 gagctcgttg cgcaagcgct tccatcagtg ggggcatgaa gtacatctcg ccgatgtcgg 60 

tcattgccaa gttgaaggtg cgcgtgctgg caaatgggtc gaaagagtca cgggtcgtca 120 

gtgccgtctg cagcgtgttg agcgcataga ccacgggctc cgcaagatgc agtgcatacg 180 

gtgtcggctc catgcctttt gaggtgcgca agaacaaatc gtcctttagc gccgcacgca 240 

gccgtttaag tgaattgctg acggcaggct gcgtcagccc cagtttttcg ccggccgtcg 300 

35 atacgctccg gtcgagcagt agctggttga agaccaccag cagattcaag tcgatgtcgc 360 

gcagatccat gatgcctcac cattattcat gctggtgatt ttaactatca gacttgatct 420 

atagcgctat accgatcgac gcgccagtat cgcagccatt cggagacaac tgaaaaaaga 4 80 
gcttgcatgg aactggtagt agaacccctc aatttgcatc tgaacgcgga gaccggcagc 
aCCCtgcttg acgtgctcag gtccaacgag gtccccattt cttatagctg catgtcgggc 

40 cgctgcggca cttgccgttg ccgtgtgatt gccggccatc ttcgcgataa cggctccgag 660 



540 
600 
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acagggcgcc 


cgcaggcagg 


aaagggggcc 


tatgtcctgg 


cctgtcaggc 


ggttctgacc 


720 




gaagactgca 


cgatcgagat 


tcctgaatct 


gacgagatcg 


tggttcaccc 


ggcgcgcatc 


780 




gtcaagggga 


cggtcacagc 


gatagacgaa 


gccacccatg 


acatccggcg 


cctgcgcatc 


840 




aaactggcca 


aaccgcttga 


gttcagccct 


ggccagtacg 


caacggtgca 


gttcacgccc 


900 


5 


gaatgcgtcc 


gcccctattc 


gatggccggg 


ctgcctagcg 


atgcggaaat 


ggagtttcag 


960 




attcgcgcgg 


ttccgggcgg 


gcatgtcagc 


aactacgttt 


tcaatgaact 


gtccgtaggc 


1020 




gcttcggtgc 


ggatcagcgg 


ccccctcgga 


acggcctatc 


tgcggcgcac 


gcacaccggc 


1080 




cccatgcttt 


gtgtgggggg 


tggaacaggt 


ctggcgcccg 


tcctttcgat 


cgttcgaggc 


1140 




gcactggaaa 


gcgggatgag 


caaccccatc 


catctgtact 


tcggtgtgcg 


gagcgagcag 


1200 


10 


gacatctatg 


acgaggaacg 


ccttcacgca 


ttggctgcaa 


ggtttccgaa 


tctcaaggtg 


1260 




aatgtcgttg 


ttgcaacagg 


ccctgccggc 


cctggtcatc 


gatccggcct 


ggtcaccgat 


1320 




ctgatcggcc 


gtgacttgcc 


caatttggcg 


ggatggcgcg 


cctacctgtg 


tggcgctccg 


1380 




gccatggtcg 


aggccctgaa 


cctgctcgtt gctcgcctag gcatagtacc 


cgggcacatc 


1440 




catgccgatg 


cgttctatcc 


cagcggcgtc 


tgagcgaagg 


caccatgcga 


acccaattca 


1500 


15 


acccaaggat 


accaagccat 


gagtgaaccc 


caacgattaa 


aacccgtgtt 


tccccaagat 


1560 




ccgaaatggc 


cgggcgaagg 


tagcagccgc 


gttcccttct 


gggcctacac 


ccgcgaagac 


1620 




ctgtacaagc 


gcgaattgga 


gcgcctgttc 


tatgcaaacc 


actggtgcta 


tgtaggcctg 


1680 




gaagccgaga 


ttccgaatcc 


aggcgacttc 


aagcgaacgg 


tgatcggtga gcgctcggtc 


1740 




atcatggtgc 


gtgatccgga 


tggcggcatc 


aacgtggtgg 


agaacgtctg 


cgcccactgt 


1800 


20 ggcatgcgct 


tttgccgcga 


gcgccacggc 


aacgccaagg 


acttcttctg 


cccctaccac 


1860 




cagtggaact 


acagcctcaa 


gggtgacctg 


cagggcgtgc 


ccttccgccg 


aggcgtcaag 


1920 




caggacggca 


aggtcaacgg 


cggcatgccc 


aaggacttca 


aactcgaaga 


acacggcctg 


1980 




accaagctca 


aggtggccgc 


ccgaggcggt 


gcagtgtttg 


cctctttcga 


ccacgatgtc 


2040 




gagcctttcg 


aggactgtgc 


gtgtacgaca 


gcgagatgat 


ccccaactcc 


ctcatctacc 


2100 


25 


ccatttgagg 


ttgcagaaca 


tgtccgagaa ctggattgat gccatcgcac gggacgctgt 


2160 




gcctgagggc 


gatgtggtcg gagtcatcgt 


ggcaggcaaa 


gacattgcct 


tctatgaggt 


2220 




ggaaggtgag 


gtcttcgcca 


ccgacaactt 


gtgtacccac 


ggggctgcgc 


gcttgagcga 


2280 




cggctttctc 


gaaggccggg 


aaattgaatg 


tcctttgcat 


caaggccgat 


tcgatgtttg 


2340 




cacgggtaaa 


gccttgtgca 


cacccctgac 


acaggacatc 


aaaacctacc 


ccgtaaaaat 


2400 


30 


cgaaaacatg 


cgcgtgatgc 


tcaagctgga 


ctaaaactct 


ttgcaggagg 


aaagccaaat 


2460 




ccggaaatca 


ccccacccaa 


cccaatcact 


acccgttttc 


aaacaagagg 


agataagcaa 


2520 




ttatgagtta 


ccaaaactta 


gtgagtgaag 


cagggctgac 


gcaaaagctc 


ctgattcatg 


2580 




gcgacaaaga 


acttttccag cacgaattga agaccatctt cgcgcggaac tggctttttc 


2640 




tgacccatga 


cagtctgatt 


ccctcccccg 


gcgactatgt 


cacagccaaa 


atgggcgtcg 


2700 


35 


atgaagtcat 


cgtctcccgc 


cagaacgatg 


gctcggtgcg 


agcctttttg 


aatgtttgcc 


2760 




gtcaccgggg 


caagacacta 


gttcacactg 


aagccggaaa 


tgcgaaaggc 


tttgtgtgcg 


2820 




gctaccacgg ctggggctac 


ggttccaacg 


gcgaactgca 


aagcgttccc 


tttgaaaaag 


2880 




agttgtacgg 


agatgcgatc 


aaaaagaaat 


gcctgggctt 


gaaagaagtc 


ccccgcatcg 


2940 




aaagctttca tggctttatc 


tatggctgtt 


ttgatgcaga 


agctcccccg 


ctcatcgatt 


3000 


40 atctgggtga 


tgcagcctgg 


tacctggaac 


ccaccttcaa gcactctggt 


ggcctggaac 


3060 



39 



3120 
3180 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



ttgtaggccc ccccggcaaa gtggtggtta aggccaactg gaagcctttt gcggaaaact 
ttgtaggtga catctaccac gttggttgga cgcacgcagc ggctttgcgc gcagggcagt 

cggtatttag ttctcttgcg ggcaacgcta agcttccacc cgaaggcgcg ggcttgcaaa 3240 
tgaccagcaa gtatggcagt ggaatgggct taacgtggga ctactactcc ggtaacttca 

5 gcgctgatat ggttcccgat ctgatggcat tcggcgccgc aaaacaggaa aaactcgcca 
aggaaatcgg cgatgtccgg gcacggattt accgcagcat tctgaacggc acggttttcc 
cgaacaacag ctttttgacc ggctccgcta ccttcaaggt ctggaacccg atcgatgaaa 
acacgaccga ggtttggacg tatgccttcg tagaaaaaga catgcctgag gacttaaagc 
gtcgcttggc tgacgcggct cagcgcagtg tcggaccagc aggattctgg gaaagcgacg 

10 acaacgaaaa catggagacg ttgtcgcaaa atgccaagaa ataccaatcc agcaacagtg 
atcagattgc cagtttgggt ttcggcaagg acgtctacgg cgacgaatgc tatccgggcg 
tcgttggcaa atcggcaatc ggcgaaacca gctatcgcgg attctaccgt gcctaccagg 

ctcacatcag cagctccaat tgggccgagt tcgaaaatgc ctcccgaaat tggcacaccg 3840 

aactcaccaa gacaactgat cgctaatcca ggagccaacc atgatgatca atacccagga 3900 

15 agacaagctg gtctccgcgc acgacgccga agaatttcac cgtttcttca tcgtacaaga 3 960 

tgatgcacta ctgcaagaag tcaacacgct cctgacccgc gaagcgcacc tgctggacat 4020 

tcaggcctac aaagcctggc ttgaacactg cgttgccccc gagatcaaat accaagtgat 4 080 

ctcgcgagaa cttcgctcca cttccgagcg tcgataccaa ctgaatgatg cggtgaatct 4140 

ctacaacgag aactatcaac agctgaaagt tcgagttgaa caccagatgg atcctcagaa 4200 

20 ctgggccaac aacccgaaga tccgcttcac ccgcttcgtc accaatgtca cggcggccaa 4260 

ggacaagagc gcaccggaaa tgctgcatgt gcggtccaac ctcattctcc atcgcgccag 4320 

acgagaaaac caagttgacg tcttctatgc aacgcgtgaa gacaaatgga aacgcatcga 4 380 

aggtggtggt atcaaattgg tcgaacgctt tgtggactac ccggagcgca ttccccagac 4440 

ccacaacctg ttggtcttcc tgtgaaccct ggggatgcct gcctggatgg cgggcattcg 4500 

25 tgattatttt taacagaaat ttattgccat gaacacacag caagttgttg ccatcactgg 4560 

cgccggctcg ggcattggtt tcgagttggt tcgctctttt aaggcagccg gttatcgcgt 4620 

atctgcactc gttcgcaacg aggagcaaga ggcgggtctt cgcagtgaat tcaaagacga 4680 

cattgagatc gtggcgggcg atgtccgtga tcacgccacc aatgagaagc tggttaaaca 4740 

gacggttgcc aagttcgggc gcctggattg cttcatcgga aatgccggga tatgggatta 4 800 

30 catgctgagc atcgatgatg cttgggagaa attctcgggc agtttcgacg agatatttga 4 860 
catcaacgtc aaaagctatt tcagcggcat cagcgcggcc ttgccggagc tc 
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<220> 

<223> A modified DNA sequence 



40 



<400> 13 

atgcatacgt agcggtggcg aaagaggcgc cgctggaaga atccggtctg tagctctggc 60 

agaagaccca aggcgagatc aaccgcaccg gactccatat cctccttcag attgccagca 120 

ttcgggcgca gcgtgctgat ctggatgtga ggagctcgtt gcgcaagcgc ttccatcagt 180 

Sgggggcatga agtacatctc gccgatgtcg gtcattgcca agttgaaggt gcgcgtgctg 24 0 

gcaaatgggt cgaaagagtc acgggtcgtc agtgccgtct gcagcgtgtt gagcgcatag 300 

atcacgggct ccgcaagatg cagtgcatac ggtgtcggct ccatgccttt tgaggtgcgc 360 

aagaacaaat cgtcgttcag cgctgtacgc agccgtttaa gtgaattact gacggcaggc 420 

tgcgtcagcc ccagtttttc gccggccgtc gatacgctcc ggtcgagcag tagctggttg 480 

lOaagaccacca gcagattcaa gtcgatgtcg cgcagatcca tgacctcacc ctcaccatta 540 

ttcatgctgg tgattttaac tatcagactt gatctatagc gctataccga tcgacgcgct 600 

agaatcgcag ccattcggag acaactgaaa aaagagcttg catggaactg gtagtagaac 660 

ccctcaattt gcatctgaac gcggagaccg gcagcaccct gcttgacgtg ctcaggtcca 720 

acgaggtccc catttcttat agctgcatgt cgggccgctg cggcacttgc cgttgccgtg 780 

IStgattgccgg ccatcttcgc gataacggcc ccgagacagg gcgcccgcag gcaggaaagg 840 

gggcctatgt cctggcctgt caggcggttc tgaccgaaga ctgcacgatc gagattcctg 900 

aatctgacga gatcgtggtt cacccggcgc gcatcgtcaa ggggacggtc acagcgatag 960 

acgaagccac ccatgacatc cggcgcctgc gcatcaaact ggccaaaccg cttgagttca 1020 

gccctggcca gtacgcaacg gtgcagttca cgcccgaatg cgtccgcccc tattcgatgg 1080 

20ccgggctgcc tagcgatgcg gaaatggagt ttcagattcg cgcggttccg ggcgggcatg 1140 

tcagcaacta cgttttcaat gaactgtccg taggcgcttc ggtgcggatc agcggccccc 1200 

tcggaacggc ctatctgcgg cgcacgcaca ccggccccat gctttgtgtg gggggtggaa 1260 

caggtctggc gcccgtcctt tcgatcgttc gaggcgcact ggaaagcggg atgagctacc 1320 

ccatccatct gtacttcggt gtgcggagcg agcaggacat ctatgacgag gaacgccttc 1380 

25acgcattggc tgcaaggttt ccgaatctca aggtgaatgt cgttgttgca acaggccctg 1440 

ccggccctgg tcatcgatcc ggcctggtca ccgatctgat cggccgtgac ttgcccaatt 1500 

tggcgggatg gcgactgcat cctgtgtggc gctccggcca tggtcgaggc cctgaacctg 1560 

ctcgttgctc gcctaggcat agtacccggg cacatccatg ccgatgcgtt ctatcccagc 1620 

ggcgtctgag cgaaggcacc atgcgaaccc aattcaaccc aaggatacca agccatgagt 1680 

30gaaccccaac gattaaaacc cgtgtttccc caagatccga aatggccggg cgaaggtagc 1740 

agccgcgttc ccttctgggc ctacacccgc gaagacctgt acaagcgcga attggagcgc 1800 

ctgttctatg caaaccactg gtgctatgta ggcctggaag ccgagattcc gaatccaggc I860 

gacttcaagc gaacggtgat cggtgagcgc tcggtcatca tggtgcgtga tccggatggc 1920 

ggcatcaacg tggtggagaa cgtctgcgcc caccgtggca tgcgcttttg ccgcgagcgc 1980 

35cacggcaacg ccaaggactt cttctgcccc taccaccagt ggaactacag cctcaagggt 2040 

gacctgcagg gcgtgccctt ccgccgtggc gtcaagcagg acggcaaggt caacggcggc 2100 

atgcccaagg acttcaaact cgaagaacac ggcctgacca agctcaaggt ggccgccccg 2160 

aggcggtgca gtgtttgcct ctttcgacca cgatgtcgag cctttcgagg acttcctggg 2220 

ccaaccatcc tgcactactt cgatcgcgtc ttcaatggcc gcaagctcaa gatcctgggc 2280 

40taccgccgcc agcgcatccc gggcaactgg aagctgatgc aggagaacat caaggacccc 2340 
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taccacccgg 


gcctgctgca 


cacctggttc 


tcgaccttcg 


ggctctggcg 


cgccgacaac 


2400 


aagtcggaac 


tgaagatgga 


cgccaagttc 


cgccacgccg 


caatgatctc 


cacgcgcggt 


24 60 


cagggcggca 


agaacgagga 


ggtcgtgtcc 


ggcgtggaca 


gcttcaagga 


acagatgaag 




gtgaacgacc 


cgcgcctgct 


cgacatcgtg 


cccgagccct 


ggtggggcgg 


tccgactgcg 


"icon 


Sgtgatgacca 


cgatcttccc 


cagcgtgatc 


atccagcagc 


aggtcaacag 


cgtatcgacc 




cgccacatcc 


agcccaacgg 


tcacggctcc 


ttcgatttcg 


tctggaccca 


cttcggcttc 


z / uu 


gaggacgaca 


acgaggagtg 


gacccagcgc 


cgcctgatcc 


aggccaacct 


gttcgggccg 


£ / OU 


gcgggcttcg 


tgtcggccga 


tgacggcgag 


gtgatcgagt 


ggtcgcagga 


aggctttgag 


noon 
zozu 


caaaaaccga 


cgcaccgcac 


cgtgatcgag 


atgggcggtc 


acgaaatcgg 


cgacacggac 


Toon 


lOcacatggtca 


ccgagacgct 


gatccgcggc 


atgtacgact 


actggcgcaa 


ggtgatgggg 




gaataaacat 


ggtagacttc 


aaaacctatt 


tcgaactgct 


gaacctgtac 


agcgactacg 


jUUU 


ccatggtgtg 


cgactccgcc 


aattgggaga 


agtggcctga 


tttcttcatc 


gagaccggca 


jUDU 


cctaccgcct 


gcaaccgcgc 


gaaaacttcg agcaggactt gccgctgtgt ctgctggcgc 




tggagagcaa ggccatgatt cgtgaccgag tgtacggtgt 


caaggaaacc 


atgtaccacg 


J loU 


15atccctacta 


ccagcgccac 


atcgtaggca 


cgccgcgcgt gctgtcagtg gagcgtgatg 


324 0 


cggacggcga gcgcatcacc gccgaagcca gctatgccgt gattcgcacc 


aagtacgacg 


3300 


gcgattccac 


gattttcaac 


gccggctatt 


accgagacgt 


gatcgtgcgc 


acgcccgagg 


3360 


gcctcaagct 


gaagtcgcgc 


ctgtgcgtgt 


acgacagcga 


aatgattccc 


aactccatca 


3420 


tctaccctat 


ctgagaagga 


atccaatgag 


cgagaactgg 


atcgacgccg 


ccgcccgcga 


3480 


20cgaggtgcca 


cgagggcgac 


gtgatcggca 


tcaatatcgt 


cggcaaggag 


attgcctcta 


3540 


cgaggtggcg ggcgagatct 


acgccaccga 


caacacctgc 


actcacggcg 


ccgcccgcat 


3600 


gagcgatggc 


tttctcgaag gccgggaaat 


tgaatgtcct 


ttgcatcaag 


gccgattcga 


3660 


tgtttgcacg ggtaaagcct 


tgtgcacacc 


cctgacacag 


gacatcaaaa 


cctaccccgt 


3 720 


aaaaatcgaa 


aacatgcgcg 


tgatgctcaa 


gctggactaa 


atgctcaagc 


tggactaaaa 


3780 


25ctctttgcag gaggaaagcc 


aaatccggaa 


atcaccccac 


ccaacccaat 


cactacccgt 




tttcaaacaa gatgagacaa gcaattatga gttaccaaaa cttagtgagt gaagcagggc 


3900 


tgacgcaaaa 


gcacctgatt 


tatggcgaca 


aagaactttt 


ccagcacgaa 


ttgaagacca 


J7DU 


tcttcgcgcg gaactggctt 


tttctgaccc 


atgacagtct 


gattccctcc 


cccggcgact 




atgtcaaagc 


caaaatgggc 


gtcgatgaag 


tcatcgtctc 


ccgccagaac 


gatggctcgg 


4080 


30tgcgagcctt 


tttgaatgtt 


tgccgtcacc 


ggggcaagac 


aatagttgac 


gctgaagccg 


4140 


gaaatgcgaa 


aggctttgtg 


tgcggttacc 


acggctgggg 


ctatggctcc 


aacggcgaac 


4200 


tgcaaagcgt 


tccctttgaa 


aaagagttgt 


acggagatgc 


gatcaaaaag 


aaatgcctgg 


4260 


gcttgaaaga 


agtcccccgc 


atcgaaagct 


ttcatggctt 


tatctatggc 


tgttttgatg 


4320 


cagaagctcc 


cccgctcatc 


gattatctgg 


gtgatgtagc 


ctggtacctg gaacccacct 


4380 


35tcaagcactc 


tggtggcctg gaacttgtag gcccccccgc 


caaagtggtg 


gttaagggca 


4440 


actggaaggt 


ttttgcggaa 


aactttgtag 


gtgacatcta 


ccacattggt 


tggacgcacg 


4500 


catctatttt 


gcgcgcaggg 


caggcgatat 


ttgctcctct 


tgcgggcaac 


gctatgcttc 


4560 


cacccgaagg 


cacgggcttg 


caagcgacca 


ccaagtatgg 


cagtggaatt 


ggcgtatcgt 


4620 


tggacgccta ctccggtgtc cagagcgctg atctggttcc cgaaatgatg gcattcggcg 


4680 


40gcgcaaaaca 


ggaaaagctc 


gccaaagaaa 


tcggcgatgt 


ccgggcgcgg 


atttaccgca 


4740 



* 
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gccaagtgaa 


cggcacggtt 


ttcccgaaca 


actgcttttt 


gaccggcgcc 


ggtgtct tea 


A Q A A 




aggtctttaa 


cccgatcgat 


gaaaacacga 


ccgaggcttg 


gacgtatgcc 


ategtagaaa 


4 860 




aagacatgcc 


tgaggactta 


aagcgtcgct 


tggctgacgc 


ggctcagcgc 


tetgteggae 






cagcaggata 


ctgggaaagc 


gacgacaacg 


acaacatggt 


gttgtcgcaa 


aatgecaaga 


4980 




Saataccaatc 


cagcaacagt 


gatctgattg 


ccgatttggg 


tttcggcaag 


gaegtctacg 






gcgacgaatg 


ctatccgggc 


gtcgttagca 


aatcggcatt 


cagcgaaacc 


aaccatcgcg 


31UU 




gattctaccg 


tgcctaccag 


gctcacatca 


gcagctccaa 


ttgggccgag 


t t cgaaaaca 






cctcccgaaa 


ttggcacacc 


gaactcacca 


agacgactga 


tcgctaatcc 


aggagecaat 






catgatgatc 


aatacccagg 


aagacaagct 


ggtctccgcg 


cacgacgccg 


aagaatttca 






lOccgtttcttc 


gtcgggcacg 


acagcgatct 


gcagcaagaa 


gtcaccacac 


tcctgacccg 






cgaagccgac 


ctgctggaca 


ttcaggccta 


caaagcctgg 


cttgaacact 


gcgttgcccc 


C A ft ft 




cgagatcaaa 


taccaagtga 


tctcgcgaga 


acttcgctcc 


acttccgagc 


gtcgatacca 


!)4oU 




actgaatgat 


gcggtgaata 


tctacaacga 


gaactatcaa 


cagctgaaag 


t tcgagttga 






acaccagatg 


gatcctcaga 


actggtacaa 


cagcccgaag 


atccgcttca 


cccgcttcgt 


D30U 




15caccaatgtc 


acggcggcca 


aggacaagag 


cgcaccggaa 


atgctgcatg 


cgcgguccaa 






cctcattctc 


catcgcgcca 


gacgaggaaa 


ccaagttgac 


gccttctatg 


caacgegaga 


CI ft ft 




agacaaatgg 


aaacgcatcg 


aaggtggtgg 


catcaaattg 


gtcgaacgct 


ttgtggacta 


KH (Zft 


ty 


cccggagcgc 


agtccccaaa 


cccacaacct 


gatgatcttc 


ctgtgagccc 


tggggatgee 


bo^U 




tgcctggatg 


gcgggcattc 


gtgattattt 


ttaacagggc 


ggattcaaaa 


gtgaagtgca 


C O O A 

5oo0 


y i 


20acaccctgga 


tttcagtgaa 


tgagagtgga 


gtgctgcggg 


ctattcacaa 


gcagttcgcg 


5940 




gtagaccgcc 


agcggtgatc 


gtactcccag 


ccctttcctt 


ggccggttgt 


tgatttcatc 


6000 




ggcaatcgca 


tcgagctgct 


cctggctgta 


gatgctcaga 


tccgtccctt 


tgggcaggta 


6060 




ctggcgcacc 


aggccattca 


tgttctcgtt 


ggagcctctt 


tgccaagggc 


tgtgcgggtc 


6120 




gcagaagtac 


acggctatgc 


cggtttgctc 


actgagcttc 


ttgtgcatgg 


acatctcccg 


6180 


01 


25gccctggtcg 


tacgtcatgc 


tcaaccgcat 


cggctgcgca 


atgcccagca 


gcttgtccgt 


6240 


L,i 


gaaggcctgc 


aagacgttgg 


cagcactggc 


cggcttgaac 


tcaggcagct 


tgaccagcat 


6300 




caccagccgg 


ctggtgcgtt 


ccaccaaggt 


cccgactgcg 


ctggcattgc 


cttctccctt 


C 1 C A 

6360 




gatgagatca 


ccttcccagt 


gcccgggaaa 


ctggcggtct 


tcgatctcgg 


gegggegtae 


6420 




atggatgctc 


agcatgtcgg 


gaatctggcc 


tctgcggtcc 


ttgcccttgc 


tgcgcggcag 


6480 




30ccgcttgttg 


tgggcgtgac 


gcagggtggc 


gatcagctct 


ttgcgcagct 


ctcccacggg 


6540 




catggcatag 


atgcagttgt 


agatggtttc 


gtgtgacacg 


cggtattcat 


ggcccagggg 


6600 




atacaaacgt 


gccagtgtca 


gggcaatctg 


ctcgggcgac 


cagcgcagcc 


gcagcagatg 


6660 




gatcaccagg 


caggacaaga 


tcgactcggg 


atgcagcttg 


atagctggtc 


tgccacagcg 


6720 




tctgcgctgc 


agggactggc 


actgggcatg 


tgcgctgcca 


tagccagatg 


aagatgeat 


6779 
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# 



43 

<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 3 

5 

<400> 14 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
10 20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
50 55 60 

15 He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 

85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
20 100 105 HO 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
130 135 140 

25 His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 

165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
30 180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
210 215 220 

35 Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 

245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
40 260 265 270 



44 



Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 \ 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
290 295 300 

5 Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 

325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
10 340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
370 375 380 

15 Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 

405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
20 420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 



<210> 15 
<211> 449 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 4 



<400> 15 

35 Met Asn Tyr Asn Asn Lys He Leu 
1 5 
Lys His Leu He His Gly Asp Glu 
20 

Thr He Phe Ala Arg Asn Trp Leu 
40 35 40 



Val Ser Glu Ser Gly Leu Ser Gin 

10 15 
Glu Leu Phe Gin His Glu Leu Lys 
25 30 
Phe Leu Thr His Asp Ser Leu He 
45 



Pro Ala Pro Gly 
50 

lie Val Ser Arg 
65 

5 Cys Arg His Arg 

Lys Gly Phe Val 
100 

Glu Leu Gin Ser 
10 115 
Asn Lys Lys Cys 
130 

His Gly Phe He 
145 

15 Asp Tyr Leu Gly 

Ser Gly Gly Leu 
180 

Ala Asn Trp Lys 
20 195 

Val Gly Trp Thr 
210 

Ser Ser Leu Ala 
225 

25 Gin Met Thr Ser 

Tyr Ser Gly Val 
260 

Gly Gly Ala Lys 
30 275 

Ala Arg He Tyr 
290 

Ser Met Leu Thr 
305 

35 Ala Asn Thr Thr 

Pro Glu Asp Leu 
340 

Gly Pro Ala Gly 
40 355 



Asp Tyr Val Thr 
55 

Gin Asn Asp Gly 
70 

Gly Lys Thr Leu 
85 

Cys Ser Tyr His 

Val Pro Phe Glu 
120 

Leu Gly Leu Lys 
135 

Tyr Gly Cys Phe 
150 

Asp Ala Ala Trp 
165 

Glu Leu Val Gly 

Ala Pro Ala Glu 
200 

His Ala Ser Ser 
215 

Gly Asn Ala Ala 
230 

Lys Tyr Gly Ser 
245 

His Ser Ala Asp 

Gin Glu Arg Leu 
280 

Arg Ser His Leu 
295 

Cys Ser Gly Val 
310 

Glu Val Trp Thr 
325 

Lys Arg Arg Leu 

Phe Trp Glu Ser 
360 



45 

Ala Lys Met Gly 
60 

Ser He Arg Ala 
75 

Val Ser Val Glu 
90 

Gly Trp Gly Phe 
105 

Lys Asp Leu Tyr 

Glu Val Ala Arg 
140 

Asp Gin Glu Ala 
155 

Tyr Leu Glu Pro 
170 

Pro Pro Gly Lys 
185 

Asn Phe Val Gly 

Leu Arg Ser Gly 
220 

Leu Pro Pro Glu 
235 

Gly Met Gly Val 
250 

Leu Val Pro Glu 
265 

Asn Lys Glu He 

Asn Cys Thr Val 
300 

Phe Lys Val Trp 

315 

Tyr Ala He Val 
330 

Ala Asp Ser Val 
345 

Asp Asp Asn Asp 



He Asp Glu Val 

Phe Leu Asn Val 
80 

Ala Gly Asn Ala 
95 

Gly Ser Asn Gly 
110 

Gly Glu Ser Leu 
125 

Val Glu Ser Phe 

Pro Pro Leu Met 
160 

Met Phe Lys His 
175 

Val Val He Lys 
190 

Asp Ala Tyr His 
205 

Glu Ser He Phe 

Gly Ala Gly Leu 
240 

Leu Trp Asp Gly 
255 

Leu Met Ala Phe 
270 

Gly Asp Val Arg 
285 

Phe Pro Asn Asn 

Asn Pro He Asp 
320 

Glu Lys Asp Met 
335 

Gin Arg Thr Val 
350 

Asn Met Glu Thr 
365 



Ala Ser Gin Asn Gly Lys Lys Tyr 

370 *• 375 

Ser Asn Leu Gly Phe Gly Glu Asp 
385 390 
5 Gly Val Val Gly Lys Ser Ala lie 
405 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
10 435 440 

Arg 



46 

Gin Ser Arg Asp Ser Asp Leu Leu 
380 

Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 16 
15 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

20 <223> A polypeptide encoded by SEQ ID NO: 5 

<400> 16 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

25 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 - 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
30 50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

35 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
40 130 135 140 



47 



His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

5 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
10 210 215 220 

Cys Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

15 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
20 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

25 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
30 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

35 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 



Arg 

40 



48 



<210> 17 
<211> 449 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> A polypeptide encoded by SEQ ID NO: 6 
<400> 17 

10 Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Phe Gly Leu Ser Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
15 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

20 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
25 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

30 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
35 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

40 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 



49 

245 250 255 

Tyr Ser Gly Val *Iis Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
5 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

10 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
15 355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 

370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

20 Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

Tyr Arg Ala Tyr Arg Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 

420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
25 435 440 445 

Arg 



<210> 18 
30 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> A polypeptide encoded by SEQ ID NO: 7 

<400> 18 

Met Asn Tyr Lys Asn Lys He Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

40 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Arg 



50 



20 25 30 

Thr lie Phe Ala »Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
5 50 55 60 

He Val Ser Arg Gin Ser Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Asn Ala Glu Ala Gly Asn Ala 
85 90 95 

10 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 HO 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
15 130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His 
165 170 175 

20 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 
25 210 215 220 

Ala Ser Leu Ala Gly Asn Ala Val Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

30 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ser Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
35 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 
325 330 335 

40 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 



j 



65 



51 



145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 

165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
5 180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 
210 215 220 

10 Cys Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 

245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
15 260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 280 285 

Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
290 295 300 

20 Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 

325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Val 
25 340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
370 375 380 

30 Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe 

405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
35 420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 



<210> 17 
<211> 449 
<212> PRT 



Lys 
Glu 

5 

Asn 

His 
145 
10 Asp 

Ser 

Ala 

15 

Val 

Ser 
225 
20 Gin 

Tyr 

Gly 

25 

Ala 

Ser 
305 
30 Ala 

Pro 

Gly 

35 

Ala 

Ser 
385 
40 Gly 



Gly Phe Val 
100 

Leu Gin Ser 

115 

Lys Lys Cys 
130 

Gly Phe He 

Tyr Leu Gly 

Gly Gly Leu 
180 

Asn Trp Lys 
195 

Gly Trp Thr 
210 

Ser Leu Ala 

Met Thr Ser 

Ser Gly Val 
260 

Gly Ala Lys 

275 
Arg He Tyr 
290 

Val Leu Thr 

Asn Thr Thr 

Glu Asp Leu 
340 

Pro Ala Gly 

355 
Ser Gin Asn 
370 

Asn Leu Gly 
Val Val Gly 



85 

Cys Ser Tyr 

Val Pro Phe 

Leu Gly Leu 
135 

Tyr Gly Cys 
150 

Asp Ala Ala 
165 

Glu Leu Val 

Ala Pro Ala 

His Ala Ser 
215 

Gly Asn Ala 

230 
Lys Tyr Gly 
245 

His Ser Ala 

Gin Glu Arg 

Arg Ser His 
295 

Cys Ser Gly 
310 

Glu Val Trp 
325 

Lys Arg Arg 

Phe Trp Glu 

Gly Lys Lys 
375 

Phe Gly Lys 

390 
Lys Ser Ala 



52 

90 

His Gly Trp 
105 

Glu Lys Glu 
120 

Lys Glu Val 

Phe Asp Gin 

Trp Tyr Leu 
170 

Gly Pro Pro 

185 
Glu Asn Phe 
200 

Ser Leu Arg 

Val Leu Pro 

Ser Gly Met 
250 

Asp Leu Val 

265 
Leu Asn Lys 
280 

Leu Asn Cys 

Val Phe Lys 

Thr Tyr Ala 
330 

Leu Ala Asp 

345 
Ser Asp Asp 
360 

Tyr Gin Ser 
Asp Val Tyr 
He Gly Glu 



Gly Phe Gly 

Leu Tyr Gly 
125 

Ala Arg Val 
140 

Glu Ala Pro 
155 

Glu Pro He 

Gly Lys Val 

Val Gly Asp 
205 

Thr Gly Glu 

220 
Pro Glu Gly 
235 

Gly Val Leu 

Pro Glu Leu 

Glu He Gly 
285 

Thr Val Phe 

300 
Val Trp Asn 
315 

He Val Glu 

Ala Val Gin 

Asn Asp Asn 
365 

Arg Asp Ser 

380 
Gly Asp Ala 
395 

Thr Ser Tyr 



95 

Ser Asn Gly 
110 

Glu Ser Leu 

Glu Ser Phe 

Ser Leu Met 
160 

Phe Lys His 
175 

Val He Lys 
190 

Ala Tyr His 

Ser He Phe 

Ala Gly Leu 
240 

Trp Asp Gly 
255 

Met Ala Phe 
270 

Asp Val Pro 

Pro Asn Asn 

Pro He Asp 
320 

Lys Asp Met 
335 

Arg Thr Val 
350 

Met Glu Thr 

Asp Leu He 

Val Tyr Pro 
400 

Arg Gly Phe 



405 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu Asp Ala Ser Ser Thr Trp His 
5 435 440 

Arg 



53 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 20 
10 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

15 <223> A polypeptide encoded by SEQ ID NO: 9 

<400> 20 

Met Asn Tyr Lys Asn Lys He Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

20 Lys His Leu He His Gly Gly Glu Gly Leu Phe Gin His Glu Leu Arg 
20 25 30 

Ala Val Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 

35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
25 50 55 60 

He Val Ser Arg Gin Ser Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Asn Ala Glu Ala Gly Asn Ala 
85 90 95 

30 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
35 130 135 140 

His Gly Phe He Tyr Ala Cys He Asp Gin Glu Ala Pro Ser Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His 
165 170 175 

40 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 



180 

Ala Asn Trp Lys Ala 
195 

Val Gly Trp Thr His 
5 210 

Ser Ser Leu Ala Gly 
225 

Gin Met Thr Ser Lys 
245 

10 Tyr Ser Gly Val His 
260 

Gly Gly Ala Lys Gin 
275 

Ala Arg lie Tyr Arg 
15 290 

Ser Val Leu Thr Cys 
305 

Ala Asn Thr Thr Glu 
325 

20 Pro Glu Asp Leu Lys 
340 

Gly Pro Ala Gly Phe 
355 

Ala Ser Gin Asn Gly 
25 370 

Ser Asn Leu Gly Phe 
385 

Gly Val Val Gly Lys 
405 

30 Tyr Arg Ala Tyr Gin 
420 

Glu Asp Ala Ser Ser 
435 

Arg 

35 



54 

185 

Pro Ala Glu Asn Phe 
200 

Ala Ser Ser Leu Cys 
215 

Asn Ala Val Leu Pro 
230 

Tyr Gly Ser Gly Met 
250 

Ser Ala Asp Leu Val 
265 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Lys Asp Val Tyr 
390 

Ser Ala He Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 
440 



190 

Val Gly Asp Ala Tyr His 
205 

Thr Gly Glu Ser He Phe 
220 

Pro Glu Gly Ala Gly Leu 
235 240 
Gly Val Leu Trp Asp Gly 
255 

Pro Glu Leu Met Ala Phe 
270 

Glu He Gly Asp Val Arg 
285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro He Asp 
315 320 
He Val Glu Lys Asp Met 
335 

Ala Val Gin Arg Thr Val 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu He 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 
445 



<210> 21 
<211> 449 
<212> PRT 
40 <213> Artificial 



Sequence 



55 



<220> 



<223> 



A polypeptide encoded by SEQ ID 



NO: 10 



<400> 



21 



5 Met Asn Tyr Lys Asn Lys Asn Leu Val Ser Glu Ser Gly Leu Thr Gin 
15 10 15 

Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin Arg Glu Leu Glu 

20 25 30 

Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He 
10 35 40 45 

Pro Ser Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

15 Cys Arg His Arg Gly Lys Thr Leu Val His Ala Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ala Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Glu Ala Leu 
20 115 120 125 

Asp Lys Lys Cys Met Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Glu Glu Ala Pro Ser Leu Lys 
145 150 155 160 

25 Asp Tyr Met Gly Asp Ala Gly Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu He Gly Pro Pro Gly Lys Val He He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Thr Gly Asp Ala Tyr His 
30 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Gin Ser Val Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

35 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Glu Val Arg 



40 



275 



280 



285 



Ala Arg lie Tyr Arg 
290 

Ser Phe Leu Thr Cys 
305 

5 Ala Asn Thr Thr Glu 
325 

Pro Glu Asp Leu Lys 
340 

Gly Pro Ala Gly Phe 
10 355 

Val Ser Gin Asn Ala 
370 

Ser Asn Leu Gly Phe 
385 

15 Gly lie Val Gly Lys 
4 05 

Tyr Arg Ala Tyr Gly 
420 

Glu Asp Val Ser Lys 
20 435 
Arg 



56 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Val Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Gly Asp Val Tyr 
390 

Ser Ala lie Gly Glu 
410 

Ala His He Ser Ser 
425 

Asn Trp His Thr Glu 
440 



Thr Val Phe Pro Asn Asn 
300 

Val Trp His Pro He Asp 
315 320 
Met Val Glu Lys Asp Met 
335 

Ala Val Gin Arg Thr Val 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Gly Asp Leu Val 
380 

Gly Asp Glu Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Ser Trp Ala Glu Phe 
430 

Leu Ala Lys Thr Thr Asp 
445 



<210> 22 
25 <211> 447 

<212> PRT 

<213> Artificial Sequence 
<220> 

30 <223> A polypeptide encoded by SEQ ID NO: 11 

<400> 22 

Met He Tyr Glu Asn Leu Val Ser Glu Ala Gly Leu Thr Gin Lys His 
15 10 15 

35 Leu He His Gly Asp Lys Glu Leu Phe Gin His Glu Leu Lys Thr He 
20 25 30 

Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He Pro Ser 

35 40 45 

Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val He Val 
40 50 55 60 



57 

Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe Leu Asn Val Cys Arg 
65 70 75 80 

His Arg Gly Lys Thr Leu Val His Ala Glu Ala Gly Asn Ala Lys Gly 
85 90 95 

5 Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly Glu Leu 
100 105 110 

Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Asp Thr He Lys Lys 

115 120 125 

Lys Cys Leu Gly Leu Lys Glu Val Pro Arg He Glu Ser Phe His Gly 
10 130 135 140 

Phe He Tyr Gly Cys Phe Asp Ala Glu Ala Pro Thr Leu Val Asp Tyr 
145 150 155 160 

Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro He Phe Lys His Ser Gly 
165 170 175 

15 Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys Ala Asn 
180 185 190 

Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His Val Gly 

195 200 205 

Trp Thr His Ala Ser Ser Leu Arg Ser Gly Gin Ser He Phe Thr Pro 
20 210 215 220 

Leu Ala Gly Asn Ala Met Leu Pro Pro Glu Gly Ala Gly Leu Gin Met 
225 230 235 240 

Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly Tyr Ser 
245 250 255 

25 Gly Val His Ser Ala Asp Leu Val Pro Glu Met Met Ala Phe Gly Gly 
260 265 . 270 

Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly Asp Val Arg Ala Arg 

275 280 285 

He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn Ser He 
30 290 295 300 

Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp Glu Asn 
305 310 315 320 

Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met Pro Glu 
325 330 335 

35 Asp Leu Lys Arg Arg Leu Ala Asp Ala Val Gin Arg Thr Val Gly Pro 
340 345 350 

Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr Glu Ser 

355 360 365 

Gin Asn Ala Lys Lys Tyr Gin Ser Ser Asn Ser Asp Leu He Ala Asn 
40 370 375 380 



58 

Leu Gly Phe Gly Lys Asp Val Tyr Gly Asp Glu Cys Tyr Pro Gly Val 
385 390 395 400 

Val Ala Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe Tyr Arg 
405 410 415 

5 Ala Tyr Gin Ala His He Ser Ser Ser Asn Trp Ala Glu Phe Glu Asn 
420 425 430 

Thr Ser Arg Asn Trp His Thr Glu Leu Thr Lys Thr Thr Asp Arg 
435 440 445 

10 <210> 23 

<211> 447 
<212> PRT 

<213> Artificial Sequence 

15 <220> 

<223> A polypeptide encoded by SEQ ID NO: 12 

<400> 23 

Met Ser Tyr Gin Asn Leu Val Ser Glu Ala Gly Leu Thr Gin Lys Leu 
20 1 5 10 15 

Leu He His Gly Asp Lys Glu Leu Phe Gin His Glu Leu Lys Thr He 

20 25 30 

Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu He Pro Ser 
35 40 45 

25 Pro Gly Asp Tyr Val Thr Ala Lys Met Gly Val Asp Glu Val He Val 
50 55 60 

Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe Leu Asn Val Cys Arg 
65 70 75 80 

His Arg Gly Lys Thr Leu Val His Thr Glu Ala Gly Asn Ala Lys Gly 
30 85 90 95 

Phe Val Cys Gly Tyr His Gly Trp Gly Tyr Gly Ser Asn Gly Glu Leu 

100 105 110 

Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly Asp Ala He Lys Lys 
115 120 125 

35 Lys Cys Leu Gly Leu Lys Glu Val Pro Arg He Glu Ser Phe His Gly 
130 135 140 

Phe lie Tyr Gly Cys Phe Asp Ala Glu Ala Pro Pro Leu lie Asp Tyr 
145 150 155 160 

Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Thr Phe Lys His Ser Gly 
40 165 170 175 



59 



Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val Val Lys Ala Asn 

180 185 190 

Trp Lys Pro Phe Ala Glu Asn Phe Val Gly Asp He Tyr His Val Gly 
195 200 205 

5 Trp Thr His Ala Ala Ala Leu Arg Ala Gly Gin Ser Val Phe Ser Ser 
210 215 220 

Leu Ala Gly Asn Ala Lys Leu Pro Pro Glu Gly Ala Gly Leu Gin Met 
225 230 235 240 

Thr Ser Lys Tyr Gly Ser Gly Met Gly Leu Thr Trp Asp Tyr Tyr Ser 
10 245 250 255 

Gly Asn Phe Ser Ala Asp Met Val Pro Asp Leu Met Ala Phe Gly Ala 

260 265 270 

Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly Asp Val Arg Ala Arg 
275 280 285 

15 He Tyr Arg Ser He Leu Asn Gly Thr Val Phe Pro Asn Asn Ser Phe 
290 295 300 

Leu Thr Gly Ser Ala Thr Phe Lys Val Trp Asn Pro He Asp Glu Asn 
305 310 315 320 

Thr Thr Glu Val Trp Thr Tyr Ala Phe Val Glu Lys Asp Met Pro Glu 
20 325 330 335 

Asp Leu Lys Arg Arg Leu Ala Asp Ala Ala Gin Arg Ser Val Gly Pro 

340 345 350 

Ala Gly Phe Trp Glu Ser Asp Asp Asn Glu Asn Met Glu Thr Leu Ser 
355 360 365 

25 Gin Asn Ala Lys Lys Tyr Gin Ser Ser Asn Ser Asp Gin He Ala Ser 
370 375 380 

Leu Gly Phe Gly Lys Asp Val Tyr Gly Asp Glu Cys Tyr Pro Gly Val 
385 390 395 400 

Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe Tyr Arg 
30 405 410 415 

Ala Tyr Gin Ala His He Ser Ser Ser Asn Trp Ala Glu Phe Glu Asn 

420 425 430 

Ala Ser Arg Asn Trp His Thr Glu Leu Thr Lys Thr Thr Asp Arg 
435 440 445 

35 

<210> 24 
<211> 451 
<212> PRT 

<213> Artificial Sequence 

40 



60 



<220> 

<223> A polypeptide encoded by SEQ ID NO: 13 



<400> 24 

5 Met Arg Gin Ala lie Met Ser Tyr Gin Asn Leu Val Ser Glu Ala Gly 
15 10 15 

Leu Thr Gin Lys His Leu He Tyr Gly Asp Lys Glu Leu Phe Gin His 

20 25 30 

Glu Leu Lys Thr He Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp 
10 35 40 45 

Ser Leu He Pro Ser Pro Gly Asp Tyr Val Lys Ala Lys Met Gly Val 

50 55 60 

Asp Glu Val He Val Ser Arg Gin Asn Asp Gly Ser Val Arg Ala Phe 
65 70 75 80 

15 Leu Asn Val Cys Arg His Arg Gly Lys Thr He Val Asp Ala Glu Ala 
85 90 95 

Gly Asn Ala Lys Gly Phe Val Cys Gly Tyr His Gly Trp Gly Tyr Gly 

100 105 110 

Ser Asn Gly Glu Leu Gin Ser Val Pro Phe Glu Lys Glu Leu Tyr Gly 
20 115 120 125 

Asp Ala He Lys Lys Lys Cys Leu Gly Leu Lys Glu Val Pro Arg He 

130 135 140 

Glu Ser Phe His Gly Phe He Tyr Gly Cys Phe Asp Ala Glu Ala Pro 
145 150 155 160 

25 Pro Leu He Asp Tyr Leu Gly Asp Val Ala Trp Tyr Leu Glu Pro Thr 
165 170 175 

Phe Lys His Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Ala Lys Val 

180 185 190 

Val Val Lys Gly Asn Trp Lys Val Phe Ala Glu Asn Phe Val Gly Asp 
30 195 200 205 

He Tyr His He Gly Trp Thr His Ala Ser He Leu Arg Ala Gly Gin 

210 215 220 

Ala He Phe Ala Pro Leu Ala Gly Asn Ala Met Leu Pro Pro Glu Gly 
225 230 235 240 

35 Thr Gly Leu Gin Ala Thr Thr Lys Tyr Gly Ser Gly He Gly Val Ser 
245 250 255 

Leu Asp Ala Tyr Ser Gly Val Gin Ser Ala Asp Leu Val Pro Glu Met 

260 265 270 

Met Ala Phe Gly Gly Ala Lys Gin Glu Lys Leu Ala Lys Glu He Gly 
40 275 280 285 



Asp Val Arg Ala Arg 
290 

Pro Asn Asn Cys Phe 
305 

5 Pro lie Asp Glu Asn 
325 

Lys Asp Met Pro Glu 
340 

Arg Ser Val Gly Pro 
10 355 

Met Val Leu Ser Gin 
370 

Leu lie Ala Asp Leu 
385 

15 Tyr Pro Gly Val Val 
405 

Gly Phe Tyr Arg Ala 
420 

Glu Phe Glu Asn Thr 
20 435 
Thr Asp Arg 
450 



61 

He Tyr Arg Ser Gin 
295 

Leu Thr Gly Ala Gly 
310 

Thr Thr Glu Ala Trp 
330 

Asp Leu Lys Arg Arg 
345 

Ala Gly Tyr Trp Glu 
360 

Asn Ala Lys Lys Tyr 
375 

Gly Phe Gly Lys Asp 
390 

Ser Lys Ser Ala Phe 
410 

Tyr Gin Ala His He 
425 

Ser Arg Asn Trp His 
440 



Val Asn Gly Thr Val Phe 
300 

Val Phe Lys Val Phe Asn 
315 320 
Thr Tyr Ala He Val Glu 
335 

Leu Ala Asp Ala Ala Gin 
350 

Ser Asp Asp Asn Asp Asn 
365 

Gin Ser Ser Asn Ser Asp 
380 

Val Tyr Gly Asp Glu Cys 
395 400 
Ser Glu Thr Asn His Arg 
415 

Ser Ser Ser Asn Trp Ala 
430 

Thr Glu Leu Thr Lys Thr 
445 



<210> 25 
25 <211> 2265 

<212> DNA 

<213> Pseudomonas sp. 



<400> 25 



30 


gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 


35 


tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 


40 


taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 



62 





cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 


5 


caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 


10 


cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgttc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 


15 


ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


20 


tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


25 


cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



<210> 26 
30 <211> 449 

<212> PRT 

<213> Pseudomonas sp. 



<400> 26 

35 Met Asn Tyr Asn Asn Lys He Leu 
1 5 

Lys His Leu He His Gly Asp Glu 
20 

Thr He Phe Ala Arg Asn Trp Leu 
40 35 40 



Val Ser Glu Ser Gly Leu Ser Gin 

10 15 
Glu Leu Phe Gin His Glu Leu Lys 
25 30 
Phe Leu Thr His Asp Ser Leu He 
45 



63 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

5 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 HO 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
10 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

15 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
20 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

25 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
30 275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 

290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

35 Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Phe 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
40 355 360 365 



Ala Ser Gin Asn Gly Lys Lys Tyr 

370 375 
Ser Asn Leu Gly Phe Gly Glu Asp 
385 390 
5 Gly Val Val Gly Lys Ser Ala He 
405 

Tyr Arg Ala Tyr Gin Ala His Val 
420 

Glu His Ala Ser Ser Thr Trp His 
10 435 440 

Arg 



64 

Gin Ser Arg Asp Ser Asp Leu Leu 
380 

Val Tyr Gly Asp Ala Val Tyr Pro 
395 400 
Gly Glu Thr Ser Tyr Arg Gly Phe 

410 415 
Ser Ser Ser Asn Trp Ala Glu Phe 
425 430 
Thr Glu Leu Thr Lys Thr Thr Asp 
445 



<210> 27 
15 <211> 2265 

<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> A modified DNA sequence 

<400> 27 





gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 


25 


cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 


30 


gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 


35 


cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 


40 


gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




65 





tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgggc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 


5 


ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 


10 


gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 




tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 


i < 
i j 


*~f l~ ^- f~% A— 

cgctctactc 


gccLLatcac 


caacy cccag 


y ccy Cdci t.yy 


a a a a nst 

ctL^yu.dctctL.yci 


/t a a a a n/i ^ "3 

LaadgdgLt.a 






cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 




cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 


20 


aatgtacgtg 


tatgggcaat 


caacaagtcg 








2265 




<210> 28 














<211> 2265 














<212> DNA 












25 


<213> Artificial Sequence 












<220> 














<223> A modified DNA sequence 








30 


<400> 28 














gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 


35 


aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 


40 


ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg agtcgctcaa 


600 



66 



taaaaaatgt ctggggttga aagaagtcgc tcgcgtggag agcttccatg gcttcatcta 660 

cggttgcttc gaccaggagg cccctcctct tatggactat ctgggtgacg ctgcttggta 720 

cctggaacct atgttcaagc attccggcgg tttagaactg gtcggtcctc caggcaaggt 780 

tgtgatcaag gccaactgga aggcacccgc ggaaaacttt gtgggagatg cataccacgt 840 

5 gggttggacg cacgcgtctt cgcttcgctc gggggagtct atcttctcgt cgctcgctgg 900 

caatgcggcg ctaccacctg aaggcgcagg cttgcaaatg acctccaaat acggcagcgg 960 

catgggtgtg ttgtgggacg gatattcagg tgtgcatagc gcagacttgg ttccggaatt 1020 

gatggcattc ggaggcgcaa agcaggaaag gctgaacaaa gaaattggcg atgttcgcgc 1080 

tcggatttat cgcagccacc tcaactgcac cgttttcccg aacaacagca tgctgacctg 1140 

10 ctcgggtgtt ttcaaagtat ggaacccgat cgacgcaaac accaccgagg tctggaccta 1200 

cgccattgtc gaaaaagaca tgcctgagga tctcaagcgc cgcttggccg actctgttca 1260 

gcgaacggcc gggcctgctg gcttctggga aagcgacgac aatgacaata tggaaacagc 1320 

ttcgcaaaac ggcaagaaat atcaatcaag agatagtgat ctgctttcaa accttggttt 13 80 

cggtgaggac gtatacggcg acgcggtcta tccaggcgtc gtcggcaaat cggcgatcgg 1440 

15 cgagaccagt tatcgtggtt tctaccgggc ttaccaggca cacgtcagca gctccaactg 1500 

ggctgagttc gagcatgcct ctagtacttg gcatactgaa cttacgaaga ctactgatcg 1560 

ctaacagacg agtcgaccat gatgatcaat attcaagaag acaagctggt ttccgcccac 1620 

gacgccgaag agattcttcg tttcttcaat tgccacgact ctgctttgca acaagaagcc 1680 

actacgctgc tgacccagga agcgcatttg ttggacattc aggcttaccg tgcttggtta 1740 

20 gagcactgcg tggggtcaga ggtgcaatat caggtcattt cacgcgaact gcgcgcagct 1800 

tcagagcgtc gttataagct caatgaagcc atgaacgttt acaacgaaaa ttttcagcaa 1860 

ctgaaagttc gagttgagca tcaactggat ccgcaaaact ggggcaacag cccgaagctg 1920 

cgctttactc gctttatcac caacgtccag gccgcaatgg acgtaaatga caaagagcta 1980 

cttcacatcc gctccaacgt cattctgcac cgggcacgac gtggcaatca ggtcgatgtc 2040 

25 ttctacgccg cccgggaaga taaatggaaa cgtggcgaag gtggagtacg aaaattggtc 2100 

cagcgattcg tcgattaccc agagcgcata cttcagacgc acaatctgat ggtctttctg 2160 

tgattcagtg accattttta caaatggtca ctgcaaccgc ggtcaccatt aatcaaaggg 2220 

aatgtacgtg tatgggcaat caacaagtcg tttcgataac cggtg 2265 

30 <210> 29 

<211> 2265 
<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> A modified DNA sequence 



<400> 29 

gagggtagag aaatcgaatg ccccttgcat caaggtcggt 
40 gccctgtgcg cacccgtgac acagaacatc aaaacatatc 



ttgacgtttg cacaggcaaa 
cagtcaagat tgagaacctg 



60 
120 



# 



67 





cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 


5 


tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg gcttcatcta 


660 


10 


cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtCt 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 


15 


catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 


20 


gcgaacgacc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 


25 


ctaacagacg agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact gcgcgcagct 


1800 




tcagagcgtc gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 


30 


ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 




cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 


35 


tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 



40 



<210> 30 
<211> 2265 
<212> DNA 



# 



68 

<213> Artificial Sequence 
<220> 

<223> A modified DNA sequence 

5 



<400> 30 





gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 


10 


aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 


15 


gagcgtggaa gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 


20 


tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 


25 


tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgctc gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 


30 


cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 




ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 


35 


actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 




tcagagcgtc gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 


40 


cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac gtggcaatca ggtcgatgtc 


2040 



69 





ttctacgccg cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 




cagcgattcg tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 


5 


aatgtacgtg tatgggcaat 


caacaagtcg 


tttcgataac 


cggtg 




2265 




<210> 31 














<211> 2265 














<212> DNA 












10 


<213> Artificial Sequence 












<220> 














<223> A modified DNA sequence 










<400> 31 












15 


gagggtagag aaatcgaatg 


ccccttgcat 


caaqqtcqqt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cqcqtaatqa ttqatttqaq 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc attccatctt 


tt ttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 


20 


tgaactgaaa accatttttg 


cocoaaacta 


actttttctc 

U ^> V»< L> k«> I* Vp* V* 


actcatgata 


gcctgattcc 


360 




tgcccccggc gactatgtta 


ccgcaaaaat 


aaaaattaac 


gaggtcatcg 


tctcccggca 


420 








Cyttcgccyg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 


25 


taaaaaatgt ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 




cggttgcttc gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 


30 


caatgcggcg ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




tcggatttat cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 


35 


cgccattgtc gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacgatc gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca gctccaactg 


1500 


40 


ggctgagttc gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 



70 





ctaacagacg 


agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag 


agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


5 


tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 




ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 




cgctttactc 


gctttatcac 


caacgtccag gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


10 


cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 




tgattcagtg 


accattttta 


caaatggtca ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg tttcgataac 


cggtg 




2265 



<210> 32 
15 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

20 <22 3> A polypeptide encoded by SEQ ID NO: 27 

<400> 32 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

25 Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
30 50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

35 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
40 130 135 140 



71 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

5 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
10 210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

15 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
20 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

25 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Gly 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
30 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

35 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

40 



72 

<210> 33 
<211> 449 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> A polypeptide encoded by SEQ ID NO: 28 



<400> 33 

10 Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 
15 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly lie Asp Glu Val 

50 55 60 

lie Val Ser Arg Gin Asn Asp Gly Ser lie Arg Ala Phe Leu Asn Val 
65 70 75 80 

20 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
25 115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 

130 135 140 

His Gly Phe lie Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

30 Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
35 195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

40 Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 




245 

Tyr Ser Gly Val His 
260 

Gly Gly Ala Lys Gin 
5 275 

Ala Arg He Tyr Arg 
290 

Ser Met Leu Thr Cys 
305 

10 Ala Asn Thr Thr Glu 
325 

Pro Glu Asp Leu Lys 
340 

Gly Pro Ala Gly Phe 
15 355 

Ala Ser Gin Asn Gly 
370 

Ser Asn Leu Gly Phe 
385 

20 Gly Val Val Gly Lys 
405 

Tyr Arg Ala Tyr Gin 
420 

Glu His Ala Ser Ser 
25 435 
Arg 



73 

250 

Ser Ala Asp Leu Val 
265 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Glu Asp Val Tyr 
390 

Ser Ala He Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 
440 



255 

Pro Glu Leu Met Ala Phe 
270 

Glu He Gly Asp Val Arg 
285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro He Asp 
315 320 
He Val Glu Lys Asp Met 
335 

Ser Val Gin Arg Thr Ala 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu Leu 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 
445 



<210> 34 
30 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> A polypeptide encoded by SEQ ID NO: 29 

<400> 34 

Met Asn Tyr Asn Asn Lys He Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

40 Lys His Leu He His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 



74 

20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 

35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly lie Asp Glu Val 
5 50 55 60 

lie Val Ser Arg Gin Asn Asp Gly Ser lie Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

10 Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 

115 120 125 

Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
15 130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
165 170 175 

20 Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val He Lys 
180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 

195 200 205 

Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser He Phe 
25 210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

30 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 

275 280 285 

Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
35 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
325 330 335 

40 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Thr 



340 

Gly Pro Ala Gly 
355 

Ala Ser Gin Asn 
5 370 

Ser Asn Leu Gly 
385 

Gly Val Val Gly 

10 Tyr Arg Ala Tyr 
420 

Glu His Ala Ser 
435 

Arg 

15 



Phe Trp Glu Ser 
360 

Gly Lys Lys Tyr 
375 

Phe Gly Glu Asp 
390 

Lys Ser Ala lie 
405 

Gin Ala His Val 

Ser Thr Trp His 
440 



75 

345 

Asp Asp Asn Asp 

Gin Ser Arg Asp 
380 

Val Tyr Gly Asp 
395 

Gly Glu Thr Ser 
410 

Ser Ser Ser Asn 
425 

Thr Glu Leu Thr 



350 

Asn Met Glu Thr 
365 

Ser Asp Leu Leu 

Ala Val Tyr Pro 
400 

Tyr Arg Gly Phe 
415 

Trp Ala Glu Phe 
430 

Lys Thr Thr Asp 
445 



<210> 35 
<211> 449 
<212> PRT 
20 <213> Artificial Sequence 



<220> 

<223> A polypeptide encoded by SEQ ID NO:30 



25 <400> 35 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 

15 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 
20 25 30 

30 Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 
35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly lie Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
35 65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 

85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 
100 105 110 

40 Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 



115 

Asn Lys Lys Cys Leu 
130 

His Gly Phe He Tyr 
5 145 

Asp Tyr Leu Gly Asp 
165 

Ser Gly Gly Leu Glu 
180 

10 Ala Asn Trp Lys Ala 
195 

Val Gly Trp Thr His 
210 

Ser Ser Leu Ala Gly 
15 225 

Gin Met Thr Ser Lys 
245 

Tyr Ser Gly Val His 
260 

20 Gly Gly Ala Lys Gin 
275 

Ala Arg He Tyr Arg 
290 

Ser Met Leu Thr Cys 
25 305 

Ala Asn Thr Thr Glu 
325 

Pro Glu Asp Leu Lys 
340 

30 Gly Pro Ala Gly Phe 
355 

Ala Ser Gin Asn Gly 
370 

Ser Asn Leu Gly Phe 
35 385 

Gly Val Val Gly Lys 
405 

Tyr Arg Ala Tyr Gin 
420 

40 Glu His Ala Ser Ser 



76 

120 

Gly Leu Lys Glu Val 
135 

Gly Cys Phe Asp Gin 
150 

Ala Ala Trp Tyr Leu 
170 

Leu Val Gly Pro Pro 
185 

Pro Ala Glu Asn Phe 
200 

Ala Ser Ser Leu Arg 
215 

Asn Ala Ala Leu Pro 
230 

Tyr Gly Ser Gly Met 
250 

Ser Ala Asp Leu Val 
265 

Glu Arg Leu Asn Lys 
280 

Ser His Leu Asn Cys 
295 

Ser Gly Val Phe Lys 
310 

Val Trp Thr Tyr Ala 
330 

Arg Arg Leu Ala Asp 
345 

Trp Glu Ser Asp Asp 
360 

Lys Lys Tyr Gin Ser 
375 

Gly Glu Asp Val Tyr 
390 

Ser Ala He Gly Glu 
410 

Ala His Val Ser Ser 
425 

Thr Trp His Thr Glu 



125 

Ala Arg Val Glu Ser Phe 
140 

Glu Ala Pro Pro Leu Met 
155 160 
Glu Pro Met Phe Lys His 
175 

Gly Lys Val Val He Lys 
190 

Val Gly Asp Ala Tyr His 
205 

Ser Gly Glu Ser He Phe 
220 

Pro Glu Gly Ala Gly Leu 
235 240 
Gly Val Leu Trp Asp Gly 
255 

Pro Glu Leu Met Ala Phe 
270 

Glu He Gly Asp Val Arg 
285 

Thr Val Phe Pro Asn Asn 
300 

Val Trp Asn Pro He Asp 
315 320 
He Val Glu Lys Asp Met 
335 

Ser Val Gin Arg Thr Leu 
350 

Asn Asp Asn Met Glu Thr 
365 

Arg Asp Ser Asp Leu Leu 
380 

Gly Asp Ala Val Tyr Pro 
395 400 
Thr Ser Tyr Arg Gly Phe 
415 

Ser Asn Trp Ala Glu Phe 
430 

Leu Thr Lys Thr Thr Asp 



77 

435 440 445 



Arg 



5 <210> 36 

<211> 449 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> A polypeptide encoded by SEQ ID NO : 31 

<400> 36 

Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 1 5 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 
35 40 45 

20 Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 
50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
25 85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
115 120 125 

30 Asn Lys Lys Cys Leu Gly Leu Lys Glu Val Ala Arg Val Glu Ser Phe 
130 135 140 

His Gly Phe He Tyr Gly Cys Phe Asp Gin Glu Ala Pro Pro Leu Met 
145 150 155 160 

Asp Tyr Leu Gly Asp Ala Ala Trp Tyr Leu Glu Pro Met Phe Lys His 
35 165 170 175 

Ser Gly Gly Leu Glu Leu Val Gly Pro Pro Gly Lys Val Val lie Lys 

180 185 190 

Ala Asn Trp Lys Ala Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His 
195 200 205 

40 Val Gly Trp Thr His Ala Ser Ser Leu Arg Ser Gly Glu Ser lie Phe 



78 

210 215 220 

Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
5 245 250 255 

Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 

260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu He Gly Asp Val Arg 
275 280 285 

10 Ala Arg He Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro He Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala He Val Glu Lys Asp Met 
15 325 330 335 

Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr He 

340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 
355 360 365 

20 Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala He Gly Glu Thr Ser Tyr Arg Gly Phe 
25 405 410 415 

Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 

420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

30 Arg 



<210> 37 
<211> 22 
35 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> An oligonucleotide 

40 



<400> 37 
ttcagcgaac ggtcgggcct gc 



79 



<210> 38 
5 <211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> An oligonucleotide 

<400> 38 
gccgggcctc ttgcgggata tcgtcca 

15 <210> 39 

<211> 27 
<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> An oligonucleotide 

<400> 39 
gttgccattg ctgcaggcat cgtggtg 

25 

<210> 40 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> An oligonucleotide 
<400> 40 

35 gaggcacccg cggaagcttt tgtgggagat gca 

<210> 41 
<211> 30 
<212> DNA 
40 <213> Artificial Sequence 



<223> An oligonucleotide 



<400> 41 

5 gcacccgcgg aacaatttgt gggagatgca 

<210> 42 
<211> 21 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

15 <400> 42 

ccgcggaaag ctttgtggga g 

<210> 43 
<211> 24 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

25 

<400> 43 
ccgcggaaaa gcttgtggga gatg 

<210> 44 
30 <211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> An oligonucleotide 

<400> 44 
cgcggaaaac gttgtgggag atg 



40 <210> 45 



<212> DNA 

<213> Artificial Sequence 



5 <220> 

<223> An oligonucleotide 

<400> 45 
atattcaggt gcgcatagcg cag 

10 

<210> 46 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

15 

<220> 

<223> An oligonucleotide 
<400> 46 

20 ggacggatat tcagggctcc atagcgcaga cttg 

<210> 47 
<211> 33 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

30 <400> 47 

gacggatatt caggtaacca tagcgcagac ttg 

<210> 48 
<211> 30 
35 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> An oligonucleotide 



<400> 48 

ggtgttttca aagtcgcgaa cccgatcgac 



<210> 49 
5 <211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> An oligonucleotide 

<400> 49 
ctgttcagcg aaacttcggg cctgct 

15 <210> 50 

<211> 26 
<212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> An oligonucleotide 

<400> 50 
ctgttcagcg aaggttcggg cctgct 

25 

<210> 51 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> An oligonucleotide 

<400> 51 
35 ctgttcagcg aagcttcggg cctgct 

<210> 52 
<211> 22 
<212> DNA 
40 <213> Artificial Sequence 



<223> An oligonucleotide 



<400> 52 
5 ttcagcgaac gctcgggcct gc 

<210> 53 
<211> 30 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

15 <400> 53 

ggcctgctgg cttcgcggaa agcgacgaca 

<210> 54 
<211> 21 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

25 

<400> 54 
gaaagcgacg ccaatgacaa t 

<210> 55 
30 <211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> An oligonucleotide 

<400> 55 

acgacaatga caattgggaa acagcttcgc 



40 <210> 56 




84 

<211> 2265 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> A modified DNA sequence 



<400> 56 





gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 


10 


gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 




aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata 


gcctgattcc 


360 


15 


tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 




gagcgtggaa gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg 


gcttcatcta 


660 


20 


cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 




tgtgatcaag gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 


25 


catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 




tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 


30 


gcgaacgtgg gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 




cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 




cgagaccagt 


tatcgtggtt 


tctaccgggc 


ttaccaggca 


cacgtcagca 


gctccaactg 


1500 




ggctgagttc 


gagcatgcct 


ctagtacttg 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 


35 


ctaacagacg agtcgaccat 


gatgatcaat 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 




gacgccgaag agattcttcg 


tttcttcaat 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 




actacgctgc 


tgacccagga 


agcgcatttg 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 




gagcactgcg 


tggggtcaga 


ggtgcaatat 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 




tcagagcgtc 


gttataagct 


caatgaagcc 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 


40 


ctgaaagttc 


gagttgagca 


tcaactggat 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 



* 



85 







y V_ l_ L- d \_ ^_ d 


Q a. v_ y l. c» y 


nrrfipaa t" an 


a r*cf t* a a p t* a a 


^ C* O CX^j ^3 ^ d. 


1980 




cttcacatcc 


gctccaacgt 


cattctgcac 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 




ttctacgccg 


cccgggaaga 


taaatggaaa 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 




cagcgattcg 


tcgattaccc 


agagcgcata 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 


5 


tgattcagtg 


accattttta 


caaatggtca 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 




aatgtacgtg 


tatgggcaat 


caacaagtcg 


tttcgataac 


CQQtcr 
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<210> 57 














<211> 2265 












10 


<212> DNA 














<213> Artificial Sequence 












<220> 












15 


<223> A modified DNA sequence 










<400> 57 














gagggtagag 


aaatcgaatg 


ccccttgcat 


caaggtcggt 


ttgacgtttg 


cacaggcaaa 


60 




gccctgtgcg 


cacccgtgac 


acagaacatc 


aaaacatatc 


cagtcaagat 


tgagaacctg 


120 




cgcgtaatga 


ttgatttgag 


ctaagaattt 


taacaggagg 


caccccgggc 


cctagagcgt 


180 


20 


aatcaccccc 


attccatctt 


ttttaggtga 


aaacatgaat 


tacaataata 


aaatcttggt 


240 




aagtgaatct 


ggtctgagcc 


aaaagcacct 


gattcatggc 


gatgaagaac 


ttttccaaca 


300 




tgaactgaaa 


accatttttg 


cgcggaactg 


gctttttctc 


actcatgata gcctgattcc 


360 




tgcccccggc 


gactatgtta 


ccgcaaaaat 


ggggattgac 


gaggtcatcg 


tctcccggca 


420 




gaacgacggt 


tcgattcgtg 


cttttctgaa 


cgtttgccgg 


catcgtggca 


agacgctggt 


480 


25 


gagcgtggaa 


gccggcaatg 


ccaaaggttt 


tgtttgcagc 


tatcacggct 


ggggcttcgg 


540 




ctccaacggt 


gaactgcaga 


gcgttccatt 


tgaaaaagat 


ctgtacggcg 


agtcgctcaa 


600 




taaaaaatgt 


ctggggttga 


aagaagtcgc 


tcgcgtggag 


agcttccatg gcttcatcta 


660 




cggttgcttc 


gaccaggagg 


cccctcctct 


tatggactat 


ctgggtgacg 


ctgcttggta 


720 




cctggaacct 


atgttcaagc 


attccggcgg 


tttagaactg 


gtcggtcctc 


caggcaaggt 


780 


30 


tgtgatcaag 


gccaactgga 


aggcacccgc 


ggaaaacttt 


gtgggagatg 


cataccacgt 


840 




gggttggacg 


cacgcgtctt 


cgcttcgctc 


gggggagtct 


atcttctcgt 


cgctcgctgg 


900 




caatgcggcg 


ctaccacctg 


aaggcgcagg 


cttgcaaatg 


acctccaaat 


acggcagcgg 


960 




catgggtgtg 


ttgtgggacg 


gatattcagg 


tgtgcatagc 


gcagacttgg 


ttccggaatt 


1020 




gatggcattc 


ggaggcgcaa 


agcaggaaag 


gctgaacaaa 


gaaattggcg 


atgttcgcgc 


1080 


35 


tcggatttat 


cgcagccacc 


tcaactgcac 


cgttttcccg 


aacaacagca 


tgctgacctg 


1140 




ctcgggtgtt 


ttcaaagtat 


ggaacccgat 


cgacgcaaac 


accaccgagg 


tctggaccta 


1200 




cgccattgtc 


gaaaaagaca 


tgcctgagga 


tctcaagcgc 


cgcttggccg 


actctgttca 


1260 




gcgaacggtc 


gggcctgctg 


gcttctggga 


aagcgacgac 


aatgacaata 


tggaaacagc 


1320 




ttcgcaaaac 


ggcaagaaat 


atcaatcaag 


agatagtgat 


ctgctttcaa 


accttggttt 


1380 


40 


cggtgaggac 


gtatacggcg 


acgcggtcta 


tccaggcgtc 


gtcggcaaat 


cggcgatcgg 


1440 



cgagaccagt tatcgtggtt tctaccgggc 
ggctgagttc gagcatgcct ctagtacttg 
ctaacagacg agtcgaccat gatgatcaat 
gacgccgaag agattcttcg tttcttcaat 
5 actacgctgc tgacccagga agcgcatttg 
gagcactgcg tggggtcaga ggtgcaatat 
tcagagcgtc gttataagct caatgaagcc 
ctgaaagttc gagttgagca tcaactggat 
cgctttactc gctttatcac caacgtccag 
10 cttcacatcc gctccaacgt cattctgcac 
ttctacgccg cccgggaaga taaatggaaa 
cagcgattcg tcgattaccc agagcgcata 
tgattcagtg accattttta caaatggtca 
aatgtacgtg tatgggcaat caacaagtcg 

15 

<210> 58 
<211> 449 
<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> A polypeptide encodec 



86 



ttaccaggca 


cacgtcagca 


gctccaactg 


1500 


gcatactgaa 


cttacgaaga 


ctactgatcg 


1560 


attcaagaag 


acaagctggt 


ttccgcccac 


1620 


tgccacgact 


ctgctttgca 


acaagaagcc 


1680 


ttggacattc 


aggcttaccg 


tgcttggtta 


1740 


caggtcattt 


cacgcgaact 


gcgcgcagct 


1800 


atgaacgttt 


acaacgaaaa 


ttttcagcaa 


1860 


ccgcaaaact 


ggggcaacag 


cccgaagctg 


1920 


gccgcaatgg 


acgtaaatga 


caaagagcta 


1980 


cgggcacgac 


gtggcaatca 


ggtcgatgtc 


2040 


cgtggcgaag 


gtggagtacg 


aaaattggtc 


2100 


cttcagacgc 


acaatctgat 


ggtctttctg 


2160 


ctgcaaccgc 


ggtcaccatt 


aatcaaaggg 


2220 


tttcgataac 


cggtg 




2265 



by SEQ ID NO: 56 



<400> 58 

25 Met Asn Tyr Asn Asn Lys lie Leu Val Ser Glu Ser Gly Leu Ser Gin 
15 10 15 

Lys His Leu lie His Gly Asp Glu Glu Leu Phe Gin His Glu Leu Lys 

20 25 30 

Thr lie Phe Ala Arg Asn Trp Leu Phe Leu Thr His Asp Ser Leu lie 
30 35 40 45 

Pro Ala Pro Gly Asp Tyr Val Thr Ala Lys Met Gly He Asp Glu Val 

50 55 60 

He Val Ser Arg Gin Asn Asp Gly Ser He Arg Ala Phe Leu Asn Val 
65 70 75 80 

35 Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly Asn Ala 
85 90 95 

Lys Gly Phe Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser Asn Gly 

100 105 110 

Glu Leu Gin Ser Val Pro Phe Glu Lys Asp Leu Tyr Gly Glu Ser Leu 
40 115 120 125 



Asn Lys Lys Cys 
130 

His Gly Phe lie 
145 

5 Asp Tyr Leu Gly 

Ser Gly Gly Leu 
180 

Ala Asn Trp Lys 
10 195 
Val Gly Trp Thr 
210 

Ser Ser Leu Ala 
225 

15 Gin Met Thr Ser 

Tyr Ser Gly Val 
260 

Gly Gly Ala Lys 
20 275 

Ala Arg He Tyr 
290 

Ser Met Leu Thr 
305 

25 Ala Asn Thr Thr 

Pro Glu Asp Leu 
340 

Gly Pro Ala Gly 
30 355 

Ala Ser Gin Asn 
370 

Ser Asn Leu Gly 
385 

35 Gly Val Val Gly 

Tyr Arg Ala Tyr 
420 

Glu His Ala Ser 
40 435 



Leu Gly Leu Lys 
135 

Tyr Gly Cys Phe 
150 

Asp Ala Ala Trp 
165 

Glu Leu Val Gly 

Ala Pro Ala Glu 
200 

His Ala Ser Ser 
215 

Gly Asn Ala Ala 
230 

Lys Tyr Gly Ser 
245 

His Ser Ala Asp 

Gin Glu Arg Leu 
280 

Arg Ser His Leu 
295 

Cys Ser Gly Val 
310 

Glu Val Trp Thr 
325 

Lys Arg Arg Leu 

Phe Trp Glu Ser 
360 

Gly Lys Lys Tyr 
375 

Phe Gly Glu Asp 
390 

Lys Ser Ala He 
405 

Gin Ala His Val 

Ser Thr Trp His 
440 



87 

Glu Val Ala Arg 
140 

Asp Gin Glu Ala 
155 

Tyr Leu Glu Pro 
170 

Pro Pro Gly Lys 
185 

Asn Phe Val Gly 

Leu Arg Ser Gly 
220 

Leu Pro Pro Glu 
235 

Gly Met Gly Val 
250 

Leu Val Pro Glu 
265 

Asn Lys Glu He 

Asn Cys Thr Val 
300 

Phe Lys Val Trp 
315 

Tyr Ala He Val 
330 

Ala Asp Ser Val 
345 

Asp Asp Asn Asp 

Gin Ser Arg Asp 
380 

Val Tyr Gly Asp 
395 

Gly Glu Thr Ser 
410 

Ser Ser Ser Asn 
425 

Thr Glu Leu Thr 



Val Glu Ser Phe 

Pro Pro Leu Met 
160 

Met Phe Lys His 
175 

Val Val He Lys 
190 

Asp Ala Tyr His 
205 

Glu Ser He Phe 

Gly Ala Gly Leu 
240 

Leu Trp Asp Gly 
255 

Leu Met Ala Phe 
270 

Gly Asp Val Arg 
285 

Phe Pro Asn Asn 

Asn Pro He Asp 
320 

Glu Lys Asp Met 
335 

Gin Arg Thr Trp 
350 

Asn Met Glu Thr 
365 

Ser Asp Leu Leu 

Ala Val Tyr Pro 
400 

Tyr Arg Gly Phe 
415 

Trp Ala Glu Phe 
430 

Lys Thr Thr Asp 
445 
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Arg 

<210> 59 
5 <211> 449 

<212> PRT 

<213> Artificial Sequence 
<220> 

10 <223> A polypeptide encoded by SEQ ID NO: 57 



<400> 59 
Met Asn Tyr Asn Asn 
1 5 
15 Lys His Leu lie His 
20 

Thr lie Phe Ala Arg 
35 

Pro Ala Pro Gly Asp 
20 50 

He Val Ser Arg Gin 
65 

Cys Arg His Arg Gly 
85 

25 Lys Gly Phe Val Cys 
100 

Glu Leu Gin Ser Val 
115 

Asn Lys Lys Cys Leu 
30 130 

His Gly Phe lie Tyr 
145 

Asp Tyr Leu Gly Asp 
165 

35 Ser Gly Gly Leu Glu 
180 

Ala Asn Trp Lys Ala 
195 

Val Gly Trp Thr His 
40 210 



Lys He Leu Val Ser 
10 

Gly Asp Glu Glu Leu 
25 

Asn Trp Leu Phe Leu 
40 

Tyr Val Thr Ala Lys 
55 

Asn Asp Gly Ser He 
70 

Lys Thr Leu Val Ser 
90 

Ser Tyr His Gly Trp 
105 

Pro Phe Glu Lys Asp 
120 

Gly Leu Lys Glu Val 
135 

Gly Cys Phe Asp Gin 
150 

Ala Ala Trp Tyr Leu 
170 

Leu Val Gly Pro Pro 
185 

Pro Ala Glu Asn Phe 
200 

Ala Ser Ser Leu Arg 
215 



Glu Ser Gly Leu Ser Gin 
15 

Phe Gin His Glu Leu Lys 
30 

Thr His Asp Ser Leu He 
45 

Met Gly He Asp Glu Val 
60 

Arg Ala Phe Leu Asn Val 
75 80 
Val Glu Ala Gly Asn Ala 
95 

Gly Phe Gly Ser Asn Gly 
110 

Leu Tyr Gly Glu Ser Leu 
125 

Ala Arg Val Glu Ser Phe 
140 

Glu Ala Pro Pro Leu Met 
155 160 
Glu Pro Met Phe Lys His 
175 

Gly Lys Val Val He Lys 
190 

Val Gly Asp Ala Tyr His 
205 

Ser Gly Glu Ser He Phe 
220 
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Ser Ser Leu Ala Gly Asn Ala Ala Leu Pro Pro Glu Gly Ala Gly Leu 
225 230 235 240 

Gin Met Thr Ser Lys Tyr Gly Ser Gly Met Gly Val Leu Trp Asp Gly 
245 250 255 

5 Tyr Ser Gly Val His Ser Ala Asp Leu Val Pro Glu Leu Met Ala Phe 
260 265 270 

Gly Gly Ala Lys Gin Glu Arg Leu Asn Lys Glu lie Gly Asp Val Arg 

275 280 285 

Ala Arg lie Tyr Arg Ser His Leu Asn Cys Thr Val Phe Pro Asn Asn 
10 290 295 300 

Ser Met Leu Thr Cys Ser Gly Val Phe Lys Val Trp Asn Pro lie Asp 
305 310 315 320 

Ala Asn Thr Thr Glu Val Trp Thr Tyr Ala lie Val Glu Lys Asp Met 
325 330 335 

15 Pro Glu Asp Leu Lys Arg Arg Leu Ala Asp Ser Val Gin Arg Thr Tyr 
340 345 350 

Gly Pro Ala Gly Phe Trp Glu Ser Asp Asp Asn Asp Asn Met Glu Thr 

355 360 365 

Ala Ser Gin Asn Gly Lys Lys Tyr Gin Ser Arg Asp Ser Asp Leu Leu 
20 370 375 380 

Ser Asn Leu Gly Phe Gly Glu Asp Val Tyr Gly Asp Ala Val Tyr Pro 
385 390 395 400 

Gly Val Val Gly Lys Ser Ala lie Gly Glu Thr Ser Tyr Arg Gly Phe 
405 410 415 

25 Tyr Arg Ala Tyr Gin Ala His Val Ser Ser Ser Asn Trp Ala Glu Phe 
420 425 430 

Glu His Ala Ser Ser Thr Trp His Thr Glu Leu Thr Lys Thr Thr Asp 
435 440 445 

Arg 

30 

<210> 60 
<211> 26 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 



40 



<400> 60 



90 

gttcagcgaa cgggcgggcc tgctgg 

<210> 61 
<211> 26 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> An oligonucleotide 

10 

<400> 61 
gttcagcgaa cggccgggcc tgctgg 

<210> 62 
15 <211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> An oligonucleotide 

<400> 62 
gttcagcgaa cgaccgggcc tgctgg 

25 <210> 63 

<211> 26 
<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> An oligonucleotide 

<400> 63 
gttcagcgaa cgatcgggcc tgctgg 

35 

<210> 64 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

40 



<220> 

<223> An oligonucleotide 

<400> 64 
5 gttcagcgaa cgtgggggcc tgctgg 

<210> 65 
<211> 25 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<223> An oligonucleotide 



15 <400> 65 

ttcagcgaac gtacgggcct gctgg 



